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Preface 


What is a fractal? Benoit Mandelbrot coined the term in 1975. There is (or 
should be) a mathematical definition, specifying a basic idea in geometry. 
There is also a figurative use of the term to describe phenomena that approx- 
imate this mathematical ideal. Roughly speaking, a fractal set is a set that 
is more “irregular” than the sets considered in classical geometry. No matter 
how much the set is magnified, smaller and smaller irregularities become visi- 
ble. Mandelbrot argues that such geometric abstractions often fit the physical 
world better than regular arrangements or smooth curves and surfaces. On 
page 1 of his book, The Fractal Geometry of Nature , he writes, “Clouds are 
not spheres, mountains are not cones, coastlines are not circles, and bark is 
not smooth, nor does lightning travel in a straight line’.’ [44, p. 1]* 

To define fractal , Mandelbrot writes: “A fractal is by definition a set for 
which the Hausdorff-Besicovitch dimension strictly exceeds the topological 
dimension’.’ [44, p. 15] It might be said that this book is a meditation on that 
verse. Study of the Hausdorff dimension requires measure theory (Chap. 5); 
study of topological dimension requires metric topology (Chap. 2). Note, how- 
ever, that Mandelbrot later expressed some reservations about this definition: 
“In science its [the definition’s] generality was to prove excessive; not only 
awkward, but genuinely inappropriate . . . This definition left out many ‘bor- 
derline fractals’, yet it took care, more or less, of the frontier ‘against’ Euclid. 
But the frontier ‘against’ true geometric chaos was left wide open.” [44, p. 159] 
We will discuss in Chap. 6 a way (proposed by James Taylor) to repair the 
definition. Mandelbrot himself, in the second printing of [44, p. 459], proposes 
“. . . to use ‘fractal’ without a pedantic definition, to use ‘fractal dimension’ 
as a generic term applicable to all the variants in Chapter 39, and to use in 
each specific case whichever definition is the most appropriate’.’ I have not 
adopted the first of these suggestions in this book, since a term without a 
“pedantic definition” cannot be discussed mathematically. I have, however, 
used the term “fractal dimension” as suggested. 


* Bracketed numbers like this refer to the references collected on p. 257. 
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This is a mathematics book. It is not about how fractals come up in 
nature; that is the topic of Mandelbrot’s book [44]. It is not about how to 
draw fractals on your computer. (I did have a lot of fun using a Macintosh 
to draw the pictures for the book, however. There will be occasional use of 
the Logo programming language for illustrative purposes.) Complete proofs 
of the main results will be presented, whenever that can reasonably be done. 
For some of the more difficult results, only the easiest non-trivial case of the 
proof (such as the case of two dimensions) is included here, with a reference 
to the complete proof in a more advanced text. 

The main examples that will be considered are subsets of Euclidean space 
(in fact, usually two-dimensional Euclidean space); but as we will see, it is 
helpful to deal with the more abstract setting of “metric spaces” 

This book deals with fractal geometry. It does not cover, for example, 
chaotic dynamical systems. That is a separate topic, although it is related. 
Another book of the same size could be written* on it. This book does not deal 
with the Mandelbrot set. Some writing on the subject has left the impression 
that “fractal” is synonymous with “Mandelbrot set”; that is far from the 
truth. This book does not deal with stochastic (or random) fractals; their 
rigorous study would require more background in probability theory than I 
have assumed for this book. 

Prerequisites 

(1) Experience in reading (and, preferably, writing) mathematical proofs is 
essential, since proofs are included here. I will say “necessary and suffi- 
cient” or “contrapositive” or “proof by induction” without explanation. 
Readers without such experience will only be able to read the book at 
a more superficial level by skipping many of the proofs. (Of course, no 
mathematics student will want to do that!) 

(2) Basic abstract set theory will be needed. For example, the abstract notion 
of a function; finite vs. infinite sets; countable vs. uncountable sets. 

(3) The main prerequisite is calculus. For example: What is a continuous 
function, and why do we care? The sum of an infinite series. The limit 
of a sequence. The least upper bound axiom (or property) for the real 
number system. Proofs of the important facts are included in many of the 
modern calculus texts. 

Advice for the reader 

Here is some advice for those trying to read the book without guidance from 
an experienced instructor. The most difficult (and tedious) parts of the book 
are probably Chaps. 2 and 5. But these chapters lead directly to the most 
important parts of the book, Chaps. 3 and 6. Most of Chap. 2 is independent 


For example, [16]. 
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of Chap. 1, so to ease the reading of Chap. 2, it might be reasonable to 
intersperse parts of Chap. 2 with parts of Chap. 1. In a similar way, Chaps. 3, 
4, and 5 are mostly independent of each other, so parts of these three chapters 
could be interspersed with each other. 

There are many exercises scattered throughout the text. Some of them 
deal with examples and supplementary material, but many of them deal with 
the main subject matter at hand. Even though the reader already knows it, I 
must repeat: Understanding will be greatly enhanced by work on the exercises 
(even when a solution is not found) . Answers to some of the exercises are given 
elsewhere in the book, but in order to encourage the reader to devote more 
work to the exercises, I have not attempted to make them easy to find. When 
an exercise is simply a declarative statement, it is to be understood that it is 
to be proved. (Professor Ross points out that if it turns out to be false, then 
you should try to salvage it.) Some exercises are easy and some are hard. I 
have even included some that I do not know how to solve. (No, I won’t tell 
you which ones they are.) 

Take a look at the Appendix. It is intended to help the reader of the book. 
There is an index of the main terms defined in the book; an index of notation; 
and a list of the fractal examples discussed in the text. 

Some illustrations are not referred to in the main text. An instructor who 
knows what they are may use them for class assignments at the appropriate 
times. 

Some of the sections that are more difficult, or deal with less central ideas, 
are marked with an asterisk (*). They should be considered optional. A section 
of “Remarks” is at the end of each chapter. It contains many miscellaneous 
items, such as: references for the material in the chapter; more sophisticated 
proofs that were omitted from the main text; suggestions for course instruc- 
tors. 


Notation 

Most notation used here is either explained in the text, or else taken from cal- 
culus and elementary set theory. A few reminders and additional explanations 
are collected here. 

Integers: Z = {■ ■ ■ , —2, —1, 0, 1, 2, • ■ • }. 

Natural numbers or positive integers: N={1,2,3,---}. 

Real numbers: R. = (— oo, oo). 

Intervals of real numbers: 

( a,b ) = { x : a < x < b} , 

(a, b] = { x : a < x < b } , etc. 

The notation (a, b) also represents an ordered pair, so the context must be 
used to distinguish. 

Set difference or relative complement: X\A = {xGX: x qL A}. 
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If f: X — > Y is a function, and x £ X, I will use parentheses f(x) for 
the value of the function at the point x; if C C X is a set, I will use square 
brackets for the image set f[C] = { f(x) : x £ C}. 

The union of a family (Aj)jgj of sets, written 

U 

iei 

consists of all points that belong to at least one of the sets A, . The intersection 

n* 

ie/ 

consists of the points that belong to all of the sets A t . The family (Aj) iel is 
said to be disjoint iff Ai H Aj = 0 for any i ^ j in the index set I. 

The supremum (or least upper bound) of a set A C R. is written sup A. 
By definition u = sup A satisfies (1) u > a for all a £ A, and (2) if y > a for 
all a £ A, then y > u. Thus, if A is not bounded above, we write sup A = oo, 
and if A = 0, we write sup A = —oo. The infimum (or greatest lower bound) 
is inf A. The upper limit of a sequence (a: n )^L 1 is 

limsupa; n = lim supife. 

n— kx) n — ►oo ^.> n 


And, if a(r) is defined for real r > 0, 

limsupa(r) = lim sup a(r). 

r — >0 s— *0 0 <r<s 

Similar notation is used for the lower limit or lim inf. 

The sign □ signals the end of a proof. 

The origin of the book 

I offered a course in 1987 at The Ohio State University on fractal geometry. 
It was intended for graduate students in mathematics, and it was based on 
the books of Hurewicz and Wallman [35] and Falconer [23]. I tried to keep 
the prerequisites at a low enough level that, for example, a graduate stu- 
dent in physics could take the course. The prerequisites listed were: metric 
topology and Lebesgue measure.* When the course was announced, I began 
getting inquiries from many other types of students, who were interested in 
studying fractal geometry more rigorously, but did not have even this minimal 
background. For example, a student in computer science with a strong back- 
ground in calculus would still have required two more years of mathematics 

* Then I found, to my surprise, that Lebesgue integration is not considered nec- 
essary for physics students. I suppose the fact that I find this incredible is an 
illustration of my ignorance of how mathematics is applied in practice. 
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study ( “Advanced Calculus” and “Introductory Real Analysis” ) before being 
prepared for the course. This book is intended to fit this sort of student. Only 
a small part of those two courses is actually required for the study of fractal 
geometry, at least at the most elementary level. The required topics from met- 
ric topology and measure theory are covered in Chaps. 2 and 5. (Mathematics 
students may be able to skip much of these two chapters.) 

This book is directly derived from notes prepared for use in a course offered 
in 1988 in connection with the program for talented high school students that 
is run here at The Ohio State University by Professor Arnold Ross for eight 
weeks every summer. The influence of these young students can be seen in 
many small ways in the book. (In particular, 1.5.7 and 1.6.1.) Past practice 
in the Ross program suggested the fruitful use of ultrametric spaces. 

Parts of the manuscript were read by Manav Das, Don Leggett, William 
McWorter, Lorraine Rellick, and Karl Schmidt. Their comments led to many 
improvements in the manuscript. I would like to thank the many peo- 
ple at Springer- Verlag New York, especially mathematics editor Rudiger 
Gebauer, mathematics assistant editor Susan Gordon, mathematics editor 
Ulrike Schmickler-Hirzebruch (editor of the “Undergraduate Texts in Mathe- 
matics” series), production editor Susan Giniger, and Kenneth Dreyhaupt. 


Columbus, Ohio 


Gerald A. Edgar 
March 1990 


Second Edition 

Arnold Ross retired from his summer program in 2000 at age 94. He passed 
away on September 25, 2002. But the summer program has continued, with 
Professor Daniel Shapiro as director. 

Although this Second Edition is substantially the same as the First, there 
were many changes. Most important was an increased emphasis on the pack- 
ing measure (Sect. 6.2), so that now it is often treated on a par with the 
Hausdorff measure. The topological dimensions were rearranged (Chap. 3), so 
that the covering dimension is the major one, and the inductive dimensions 
are the variants. A “reduced cover class” notion was introduced to help in 
proofs for Method I or Method II measures (p. 149). Research results since 
1990 that affect these elementary topics have been taken into account. Some 
examples have been removed (golden rectangle, Barnsley wreath). Other ex- 
amples have been added (Barnsley leaf, Li lace, Julia set). Terminology has 
been changed (addressing function, self-referential equation, Lipschitz a, con- 
stituent). Notation has been changed (spectral radius). Most of the figures 
have been re-drawn. 


XII Preface 


Many readers of the first edition of this book provided comments and sug- 
gestions which have improved the second edition. Among these are: Richard B. 
Darst, Manav Das, Paolo Facchi & Saverio Pascazio, Julia Genyuk, Na Peng, 
Lorraine Rellick, John F. Rossi, M. Szyszliowicz, and Yang Ke. Springer has 
been helpful in the preparation, especially pure mathematics editor Mark 
Spencer and editorial assistant Charlene Cerdas. In addition to the soft- 
ware provided with Mac OS X, preparation of the book involved: ASC Logo, 
InkScape, Photoshop, TeXShop. 


Columbus, Ohio 


Gerald A. Edgar 
August 2007 


His days and times are past, 
And my reliances on his fracted dates 
Have smit my credit: I love and honor him 
— W. Shakespeare, Timon of Athens 
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Fractal Examples 


A few basic mathematical examples of fractals will be introduced in this 
chapter. Their analysis, and especially the question of what makes them 
“fractals” must be postponed until much later in the book. 

One of the surprising ideas in the subject is the contention that the “di- 
mension” of a set might be a real number that is not an integer. If we say that 
a set C in the plane has dimension 1.7, then we mean that its properties are 
“between” those of a curve (dimension 1) and an open region (dimension 2). 
The technical aspects of such “fractal” dimensions are discussed in Chap. 6. 

Another characteristic feature of fractals emphasizes their difference from 
the sets of classical geometry. Typically, a fractal looks irregular; but more 
importantly, after it is magnified it still looks irregular. A typical set from 
classical geometry becomes very simple looking if it is magnified enough. One 
of the ways in which this behavior under magnification can be specified is 
through the use of an “iterated function system’.’ Several of the examples in 
this chapter have descriptions in terms of iterated function systems, but the 
more detailed discussion is in Chap. 4. 

1.1 The Triadic Cantor Dust 

We will begin with an example. It will be treated in several different ways. 
Examples that we study later will often have something in common with one 
or more of the aspects treated for this example. This example is known as 
“the Cantor set’.’ Mandelbrot has called it the “Cantor dust” ; this descriptive 
name shows what kind of set it is. (The term “dust” refers to the fact that 
the set is zero-dimensional; this will be discussed in Chap. 3.) 

Construction by Tremas 

The triadic Cantor dust is a subset of the line R. A sequence of approxima- 
tions is first defined. Start with the closed interval Co = [0, 1]. Then the set Ci 
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is obtained by removing the middle third from [0,1], leaving [0, 1/3] U [2/3, 1]. 
The next set C 2 is defined by removing the middle third of each of the two 
intervals of C\. This leaves 

C 2 = [0, 1/9] U [2/9, 1/3] U [2/3, 7/9] U [8/9, 1] . 

And so on. (See Fig. 1.1.1.) The triadic Cantor dust is the “limit” C of 
the sequence C n of sets. The sets decrease: Co 5 Ci 3 C 2 3 • • ■ . So we will 
define the “limit” to be the intersection of the sets, 

C= f]C k . 
ke N 

We will later do other constructions in a similar way. The parts that are 
removed are called tremas. (Mandelbrot coined this word from a Latin word 
meaning “hole’.’ ) 

The sequence of sets is defined recursively. This means that it will often 
be easy to prove facts about the sets by induction. For example, the set C k 
consists of 2 k disjoint closed intervals, each of length (1/3) . So the total 
length of C k , the sum of the lengths, is (2/3 ) k . The limit is 



So the total length of the Cantor dust itself is zero. (The mathematical ver- 
sion of “total length” is called “Lebesgue measure’.’ It will be dealt with in 
Chap. 5). So total length is not a very useful way to compute the size of C . 
We will see that this is related to the fact that the fractal dimension of C is 
strictly less than 1. 

Let us consider more carefully what points constitute the Cantor set. If 
[a, b } is one of the closed intervals that makes up one of the approximations 
C k , then the endpoints a and b belong to all of the future sets C m , m > k, 
and therefore belong to the intersection C. (Again, prove this by induction.) 

Co 


Ci 


C 2 

C 3 

C 4 


Fig. 1.1.1. The Triadic Cantor Dust 
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Taking all the endpoints of all the intervals of all the approximations Ck, we 
get an infinite set of points, all belonging to C. (This set of endpoints is only 
a countable set, however.) 

It is important to note that there are points in C other than these end- 
points. 

Exercise 1.1.2. The point 1/4 is not an endpoint of any interval of any set 
Ck- But the point 1/4 belongs to C. 

It is also important to note that C is not like the usual sets of elementary 
geometry. At first, it is likely to tax your powers of geometrical visualization. 
If you can’t imagine it today, try again tomorrow. Here are a few tidbits to 
help: 

Exercise 1.1.3. (a) The set C contains no interval of positive length. 

(b) The set C has no isolated points: that is, if a £ C, then for every e > 0, 
no matter how small, the interval (a — s, a + e) contains points of C in 
addition to a. 

(c) The set C is closed: that is, if a G R has the property that every interval 
of the form (a — £, a + e) intersects C, then a £ C. 

Exercise 1.1.4. Let r be a positive number. How many tremas are there with 
length > r? 

Coordinates 

There is a convenient way to characterize the elements of the triadic Cantor 
dust in terms of their expansions in base 3. 

First we will review the standard facts concerning base 3. You know how 
expansions in the usual base 10 work. Base 3 is of course similar. Every positive 
integer x has a unique representation 

M 

x = 

j = o 

where the “digits” a,j are chosen from the list 0,1,2. For example, 

15 = 0 x 3° + 2 x 3 1 + 1 x 3 2 . 

We will sometimes write simply 15 = (120)3. It is understood that the sub- 
script specifying the base is always written in base ten. Similarly, we may 
represent fractions: Every number x between 0 and 1 has a representation in 
the form 

-1 

0 =- 00 


X = 
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with digits 0,1,2. These are written with a “radix point” (or “ternary point” 
in this case): 

7/9= (0.21)3, 

1/4 = (0.02020202 • • • )3 (a repeating expansion), 
v / 2 = (1.1020112- --) 3 (non-repeating) . 

Some numbers (rational numbers of the form a/ 3 fc ) admit two different 
expansions. For example, 

1/3 = (0.1000000 • • ■ ) 3 = (0.0222222 • • • ) 3 . 

Proposition 1.1.5. Let x £ [0,1]. Then x belongs to the triadic Cantor dust 
C if and only if x has a base 3 expansion using only the digits 0 and 2* 

Proof. The first place to the right of the ternary point is a 1 if and only if x 
is between 

(0.1000000 •• -) 3 = 1/3 and (0.1222222 ••• ) 3 = 2/3. 

The first trema is the interval (1/3, 2/3). After this trema is removed, we have 
C\. (The numbers 1/3 and 2/3 each have two expansions, one with a 1 in the 
first place, and one without. So they should not be removed.) So C\ contains 
exactly the numbers in [0, 1] that have a base 3 expansion not using 1 in the 
first place. The second place of a number x in C\ is a 1 if and only if x belongs 
to one of the second-level tremas (1/9, 2/9) or (7/9, 8/9). When these tremas 
are removed, we have C 2 . So C 2 contains exactly the numbers in [0, 1] that 
have a base 3 expansion using 1 neither in the first place nor the second place. 
Continuing in this way, we see that the points remaining in C = p| fcgN C k are 
exactly the numbers in [0, 1] that have a base 3 expansion not using 1 at all. □ 

The Cantor dust is uncountable. This follows from the representation just 
proved, together with the observation that each real number has at most 
two representations base 3. (Actually, for numbers in the Cantor dust, two 
different sequences of 0’s and 2’s always represent different real numbers. See 
Exercise 1.6.2.) 

Construction by Translations 

Suppose L is a subset of M and s is a real number. The translate of L by s 
is the set 

{ £ + s : x £ L} . 

* This proposition will show that 1/4 £ C, which is part of Exercise 1.1.2. But I 
hope you already solved it yourself. Was your solution for the special number 1 /4 
simpler than this general case? 
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L 0 • 

Li • 


L 2 • • 

L% • • • 

1 • • < 


Fig. 1.1.6. Translation Construction 


That is, we add s to each element of L. This is sometimes written L + s. 

We construct recursively a sequence {Lk) of subsets of the line R, together 
with a sequence (s^) of real numbers. (Fig. 1.1.6.) Begin with the number 
so = 2/3, and a starting set Lq consisting of the one point 0. The next set 
L\ is obtained by combining Lq with its translate by so- So L\ = {0,2/3}. 
The next term in the sequence (s^) will be 1/3 times the previous one; Si = 
(1/3) (2/3) = 2/9. It will be used to obtain L 2 from L\\ 

L 2 = L l U (Li + si) = {0, 2/9, 2/3, 8/9}. 

Then s 2 = (l/3)si and L 3 = L2U(L2+s 2 )- And so on. The set of interest is the 
“limit” L of the sequence L n . This sequence is increasing: To Q L\ C L 2 C • • • , 
so one reasonable definition for a limit would be the union: 

L= U L fc . 

fceN 

But in fact, we will see later another (and, for our purposes, better) way to 
define the “limit” of a sequence of sets such as {Lk). 

What is the connection between this construction and the previous con- 
struction of the Cantor dust Cl The set Lk consists of 2 fc points. They are 
exactly the left endpoints of the intervals that make up the set Ck ■ Or, they 
are the right endpoints of the tremas removed from [0, 1] to construct Ck (plus 
the one point 0). 

The points of Lk are the numbers in [0, 1] that have a base 3 representation 
with k digits involving only 0’s and 2’s. (Prove this by induction on k.) For 
example, L 2 consists of 


( 0 . 00)3 = 0 , 
(0.02)3 = 2/9, 
(0.20)3 = 2/3, 
(0.22)3 = 8/9. 
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Certainly the set L = (J fceN k is not equal to the Cantor dust C. The 
number 1/4 does not belong to L. But L is “close” to C in the following sense. 

Proposition 1.1.7. If x £ C, then x is the limit of a sequence of points of L. 

Proof. Since x £ C, we know that x has a base 3 representation: 

OO 

x = aj3~i , each a,j = 0 or 2. 

3 = i 

If this representation is truncated after only k terms, we get a number 

k 

Xk — ^ ^ a 0 3 

j-i 

which is an element of Lk . Now 

OO OO 

\x-x k \= a j3~ j < 2 • 3 -J = 3 -fe . 

j—k-\- 1 

Since limfe-.oo 3 _fc = 0, we may conclude that limfc_ >00 Xk = x. Thus x £ C is 
the limit of the sequence Xk £ L. 

The set L is dense in the Cantor set C. This means that L C C, and 
every point of C is the limit of a sequence of points of L. 

Iterated Function System 

Let r > 0 and atl. The dilation on R with ratio r and center a is the 
function / : 1R — > R. given by f(x) = rx + (1 — r)a. 

Consider the two dilations on R defined by: 

/i(X) = |, fo( x ) = • 

They both have ratio 1/3. The first has center 0 and the second has center 1. 

Proposition 1.1.9. The triadic Cantor dust C satisfies the self -referential 
equation 

C = A[C]u f 2 [C]. 


0 



1 


0 1/3 1 


0 



1 


0 2/3 1 


Fig. 1.1.8. Two dilations 
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Proof. It follows by induction that 


c k+1 = Mc k }uf 2 [c k } 


for k = 0, 1, • • ■ . 


First I will prove that C C f±[C] U /^[C]. Suppose x £ C. Then x £ C\. 
So either x £ [0, 1/3] or x £ [2/3, 1], We take the case in which x £ [2/3, 1]; 
the other case is similar. Now for any fc, we know x £ C k + 1 = /i[Cfc] U/ 2 [Cfc]. 
But 


/ilC'fc] C/r[[0,l]] 



so in fact x £ /^[Gt], or 3a; — 2 £ C*,. This is true for all k, so 3x — 2 £ 
DfceN^fe = C. Thus, a: £ / 2 [C] - In the other case, x £ /i[C]. So in any case, 
we have x £ /i [C] U /2 [C] . 

Next I will prove that 


CD/r^U/alC-]. 

Suppose x £ /i [C] U / 2 [C] . Either x £ /i [C] or a ; £ / 2 [C] . We take the case 
x £ /2 [C] ; the other case is similar. Thus, 3x — 2 £ C. Now for any k, we know 
3x - 2 £ C fc , or x £ h[C k ] C C fc+ i. Thus 


x £ p) Cfc+r = H = C - 

/cGN fcGN 


This completes the proof that C3/i [C] U /2 [C] . □ 

We will call the pair (/i, / 2 ) an iterated function system , and we will say 
that C is the* invariant set (or attractor) of the iterated function system 

(/i,/ 2 ). 

Exercise 1.1.10. There are sets 4 / C also satisfying A = fi[A] U f 2 [A]. 
How many can you find? 


1.2 The Sierpinski Gasket 

The next example is a set in the plane known as the Sierpinski gasket. 


Construction by tremas 

Start with a filled-in equilateral triangle with side length 1 (the triangle to- 
gether with the region inside). Call it 5o- It may be subdivided into four 
smaller triangles, using lines joining the midpoints of the sides. The smaller 

* In Chap. 4, extra conditions will be added that will make the invariant set of an 
iterated function system unique, justifying the word “the’.’ 
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triangles have side length 1/2. The middle triangle is rotated 180 degrees 
compared to the others. The trema to be removed is the middle triangle (the 
“open triangle” — remove the interior but leave boundary of the triangle, the 
edges and vertices). After it is removed, the remaining set is Si, a subset of So- 
Now each of the three remaining triangles should be subdivided into smaller 
triangles with edge length 1/4, and the three middle triangles removed. The 
result is S 2 , a subset of Si. We should continue in the same way, to obtain a 
sequence Sk of sets. The Sierpinski gasket is the limit S of this sequence of 
sets. (Fig. 1.2.1.) The sequence is decreasing (So 2 Si D 62 2 • • • ), so by the 
“limit” we mean the intersection S = f\eN 

The set Sk consists of 3 fc triangles, with side 2~ k . So the total area of 
Sk is 3 fe • (2 -fc ) 2 • -v/3/4. This converges to 0 as k — > 00 . The total area of 
the Sierpinski gasket itself is therefore 0. Thus “area” is not very useful in 
measuring the size of S. Area is used to measure the size of a set of dimen- 
sion 2. A line segment, which has dimension 1, has area 0. In a similar way, 
we will see that the Sierpinski gasket S can be said to have dimension less 
than 2 . 

The line segments that make up the boundary of one of the triangles of 
S n remain in all the later approximations Sk, k > n. So the set S contains 
at least all of these line segments. In Sk there are 3 fe triangles, each having 
3 sides of length 2~ k . So the “total length” of S is at least 3 fc • 3 • 2~ k . This 
goes to 00 as k — > 00 . So it makes sense to say that the total length of S is 
infinite. So “length” is not very useful to measure the size of S. Length is used 
to measure the size of a set of dimension 1. A square (with its inside), which 
has dimension 2 , has infinite length, since it contains as many disjoint line 
segments as you like. In a similar way, we will see that the Sierpinski gasket 
S can be said to have dimension greater than 1. 

So S supposedly has dimension greater than 1 but also less than 2. There 
is no integer between 1 and 2. The way around this dilemma, proposed by 
Hausdorff in 1919, is to allow the dimension of a set to be a fraction. According 



Fig. 1.2.1. The Sierpinski gasket 
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to Hausdorff’s definition (Chap. 6), the Sierpinski gasket has dimension ap- 
proximately 1.58. 


Iterated Function System 

Let r > 0 be a real number and let a be a point in the plane. The dilation 
with ratio r and center a is a map / of the plane to itself such that each point 
x is mapped to a point /(x), which is on the ray from a through x, and the 
distance from a to /(x) is r times the distance from a to x. By convention, 
f(a) = a also. (Fig. 1.2.2. If r < 1, as shown, distances decrease; if r > 1, 
distances increase.) 

Exercise 1.2.3. A dilation / maps lines to lines: that is, if L is a line, then 
the set f[L\ = { /(x) : x € L } is also a line. A dilation / preserves angles:* 
that is, if lines L\ and L 2 meet at angle 9 , then lines f[L{\ and /[X2] also 
meet at angle 9. 

The Sierpinski gasket was constructed above using approximations S / ■ Let fi, 
/2, /*3 be the three dilations with ratio 1/2 and centers at the three vertices 
of the triangle Sq. Now it follows by induction that 

Sk+i = fi[Sk]\Jh[Sk]Uf 3 [Sk]. 

Then, in much the same way as in Proposition 1.1.9, we can see that 

S = h[S]U f 2 [S]U f 3 [S] 

by showing each side is a subset of the other. This self-referential equation 
means that S is the invariant set of the iterated function system (/1, /2, / 3). 


Coordinates 

There is a description of the Sierpinski gasket in terms of coordinates. 




Fig. 1.2.2. A dilation of the plane 


We say that / is conformal. 


10 


1 Fractal Examples 



Exercise 1.2.4. Let coordinates (u,v) be defined in the plane with origin at 
one corner of the triangle So, and axes along two of the sides of So- Then 
coordinates (u,v) with 0 < it < 1, 0 < v < 1 represent a point of the Sierpiri- 
ski gasket if and only if the base 2 expansions of u and v never have 1 in the 
same place. 

Another description of the condition on the base 2 expansions of u and v 
is to say that the sum u + v can be computed (base 2) without carrying. You 
will also need to take into account the numbers with two different expansions 
in base 2. 

The preceding exercise may suggest a “translation” type construction for 
the Sierpinski gasket. Start with a single point. Choose two directions, at a 
60 degree angle with each other. Start with a set Lq containing that one 
point, and a number sq = 1/2. The next set is the union of three sets: 
Lq and the translates of Lq through distance Sq in the two chosen direc- 
tions. Then let Si = (l/2)so, and let L 2 consist of L\ together with the 
translates of L\ through distance Si in the two directions. And so on. (See 
Fig. 1.2.7.) 

We say that a plane set L is dense in a set S iff L C S and every point 
of S is the limit of a sequence of points of L. 

Exercise 1.2.6. Prove that the union L = UfceN ^ fc is dense in the Sierpinski 
gasket S. 

Consider Pascal’s triangle: 

1 

1 1 
12 1 
13 3 1 

14 6 

etc. 


4 


1 
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Fig. 1.2.7. Translation construction 



Fig. 1.2.8. Pascal’s triangle modulo 2 


Now, if we make a black dot wherever there is an odd number, and leave blank 
wherever there is an even number, we will get a geometric arrangement in the 
plane (Fig. 1.2.8). 

Exercise 1.2.9. Why does the figure look like the Sierpinski gasket? 


1.3 A Space of Strings 

An infinite binary tree is pictured in Fig. 1.3.1. It is supposed to continue 
indefinitely at the top. Each node has two nodes immediately above it, called 
its children. (Sometimes it might be convenient to distinguish them as the 
left child and the right child.) The node at the bottom, with no parent, 
is called the root of the tree. (Sometimes the tree is drawn the other side 
up; then it looks less like a “tree”, but terminology such as “child” is more 
reasonable.) 

What is a more concrete model of this structure; a model that could be 
used to investigate the properties of an infinite binary tree? 
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We consider two symbols, say 0 and 1. Then we consider finite strings 
(or words) made up of these symbols. For example 

001010011 

The number of symbols in a string a is called the length of the string, and 
written |a|. The string above has length 9. How many strings of length n are 
there? By convention, we say that there is a unique string of length 0, called 
the empty string , which will be denoted A. 

If a and [3 are two strings, then we may form a string a/3, called the 
concatenation, by listing the symbols of the string a followed by the symbols 
of the string (3. 

The set of all such finite strings (from the alphabet E = {0, 1}) can be 
identified with the infinite binary tree: The root of the tree corresponds to 
A; if a is a string, then the left child of a is a0 and the right child of a 
is a 1 . 

We will write E ( n ' > for the set of all strings of length n from the alphabet 
E. (Recall that E^ has one element A.) We will write 

£(*) = E (0) y E (l) y E (2) y . . . 

for the set of all finite strings. Because of the correspondence with the nodes 
of the infinite binary tree, we may sometimes refer to E itself as the infinite 
binary tree. String a represents an ancestor of string f3 in the tree if and only 
if a is an initial segment of (3, that is, /? = ay for some string 7 . Or, a is 
a prefix of /?. We will write a < (3 in this case. If |a| > n, we write a\n for 
the initial segment of a of length n; it consists of the first n symbols of the 
string a. Thus, (001010011) (4 = 0010. 
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Let A be a subset (finite or infinite) of the infinite binary tree E^*\ A 
node 7 £ is a lower bound for the set A iff 7 < a for all a £ A. (When 
we think of strings, we might say that 7 is a “common prefix” for the set A.) 
A node (3 is a greatest lower bound for the set A iff f3 is a lower bound for 
A and 7 < /? for any other lower bound 7 for A. (In string language, /3 is the 
“longest common prefix” for the strings in the set A.) 

Proposition 1.3.2. Every nonempty subset A of the infinite binary tree E^ 
has a unique greatest lower bound. 

Proof. First, choose some 7 £ A. This is possible since A is not empty. Let 
n = I7I be the length of 7. Consider the integers k with 0 < k < n. Some 
of them, for example 0, have the property that yffc is a lower bound for A. 
Let k 0 be the largest such k. (A finite nonempty set of integers has a largest 
element.) I claim that yffco is the desired greatest lower bound. 

First, yffcg is a lower bound for A. Let (3 be any other lower bound for A. 
Now 7 £ A, so j3 < 7. That means (3 = yffc for some k with 0 < k < n. By the 
definition of fcg, we know that k < k^. So (3 < yf/cg. Thus yffcg is the greatest 
lower bound of A. 

If a and (3 are both greatest lower bounds of the set A, then each is < 
the other, so they are equal. □ 

Next, let E l 'A) be the set of infinite strings from the alphabet E. For 
a g £;(“) there are initial segments erfn of all sizes; we can think of an infinite 
string tr in terms of its initial segments, beginning with the empty string A: 

A = erfO 

< erfl 

< <7 (2 

< • . 

Another way to describe an element cr £ E^ is as an infinite sequence of 
letters from E = {0, 1}. (The Greek letter omega, w, is used in set theory to 
represent the least infinite ordinal. Here is shows the order type that is used 
for our infinite strings.) 

If cr £ and a £ E^*\ then concatenation aer £ E^ still makes sense. 
If o £ E and n £ N, then the prefix o\n £ E makse sense. 

We will be interested in certain subsets of E^ u \ If a £ £(*) is a finite 
string, let 

[a] = | a £ E^ : a < <7 | , 

the set of all infinite strings that begin with a. We may call this the cylinder 
defined by a. Now the two children of a are aO and al. The corresponding 
sets satisfy 


[a] = [aO] U [al], [aO] D [al] = 0. 
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We can think of the usual base 2 system as defining a function h from 
strings to real numbers, by adding a binary point on the left. For example, 
the periodic string a = 001001001 • ■ • corresponds to the real number 

h{a) = (0.001001001- • • ) 2 = y. 

The set h of values of h is exactly [0, 1]. 

I have made assertions before about when two strings correspond to the 
same real number. Can you prove them? 


Exercise 1.3.3. What are necessary and sufficient conditions on infinite 
strings a, t £ E ^ so that h(a) = h(r )? 


A situation like the one just described will be called a “string model”: 
the set of interest, such as [0, 1], is related to a set E ^ of strings, called the 
“model” by a function, such as h: E^ — > R, called the “model map’.’ The 
model can be used to study the set of interest. If the model map h is un- 
derstood, sometimes we may say that the infinite string a is the address of 
the point h(a). Alternate terminology calls E^ a code space and h the 
addressing function. 

Here is a second example of a string model. The set of interest is the 
triadic Cantor dust C. The space of strings is again the set E ^ of infinite 
strings from the two-letter alphabet {0, 1}. But now the map h: E^' 1 — > R is 
slightly different. Basically what we want to do is to have the letters 0 and 1 
correspond to digits 0 and 2 respectively, and write the numbers in base 3. 

So, for example, the periodic string a = 001001001 ■ • • corresponds to the 
real number 

h{a) = (0.002002002 • • • ) 3 = 

JLO 

According to Proposition 1.1.5, the range of h is exactly the Cantor dust: 
h [E(“)] = C. The model map h is related to the two dilations associated with 
C above: 

Exercise 1.3.4. Let h: E^ — ► R be the model map just defined. Then for 
all strings cr £ E^\ 


h(0a) 

h{l(j) 


h (°) 

3 

h(a) + 2 
3 


1.4 Turtle Graphics 

Many of the examples we will discuss are defined recursively. At least the finite 
approximations to the sets can be drawn by a computer graphics program. 


1.4 Turtle Graphics 
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When it seems useful, the Logo programs for the sets will be included in the 
book. In this section, the few elements of the Logo language are discussed. 
The main part of Logo that we will be concerned with is drawing with “turtle 
graphics’.’ 

Other computer languages could be used in place of Logo. The main re- 
quirement is the existence of graphics commands. Abelson and diSessa [1, 
Appendix B] discuss how to implement turtle graphics in other languages 
(BASIC, Pascal, APL, Lisp, Smalltalk). For example, in Pascal we would de- 
fine routines FORWARD, BACK, LEFT, RIGHT, and then use the Logo programs 
with appropriate changes in syntax (parentheses and commas). 


Logo 

We should think of drawing in the plane, as represented by the computer 
screen. Our drawing instrument, known as a turtle, is pictured in Fig. 1.4.1(a). 
(On some versions of Logo, it may be simplified to a triangle.) Its properties 
include a “position” (a point in the plane) and a “heading” (the direction the 
turtle faces). 

The commands forward and back make the turtle move. The argument 
is the distance to move, measured in some convenient units. As it moves, the 
turtle draws a line. 

The commands left and right turn the turtle (change the heading). 
The angle is measured in degrees. 

Between penup and pendown, no drawing occurs. 

The command repeat can be used for repetition. Its first argument is the 
number of times to repeat, and the second argument is the list of commands 
to be repeated. 

Variables can be used to store values. A colon preceding the name of the 
variable means that we want to refer to the value of the variable. A double- 
quote preceding the name of the variable means that we want to refer to the 
name of the variable itself. One of the Logo assignment statements is make. 

To assign a value to the variable, make needs to know its name, not its old 




Fig. 1.4.1. (a) Turtle (b) forward 50 
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Fig. 1.4.1. (c) right 90 





Fig. 1.4.2. 

(e) forward 50 





value. But on the other hand, repeat needs to know the value 5, not the name 
of the variable. 

make "n 5 

repeat :n [forward 50 left 360/:n] 

The usual arithmetic can be performed: addition :x + :y, subtraction 
:x - :y, multiplication :x * :y, division :x / :y, square root sqrt :x. 
Some versions of Logo also have powers : x A : y. 

Commands can be combined to make new commands. Here is a definition 
of a command polygon. It will have as arguments the length of each side, and 
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the number of sides. The key words to and end show such a definition. (Some 
versions of Logo do not use to and end. but have other methods of defining 
commands.) 

to polygon :size :n 

repeat :n [forward :size left 360 / :n] 

end 

After polygon has been defined, it may be used like any other com- 
mand (Fig. 1.4.5). Polygons with more and more sides (of shorter and shorter 
lengths) converge to a circle. (Convergence for sets is discussed in Sections 2.3 
and 2.4.) Since the graphics screen of the computer (and the ink on this page, 
as well) has only finite resolution, drawing a regular polygon with enough 
sides is the same as drawing a circle. 

Exercise 1.4.6. Write an ellipse program, using only the Logo commands 
discussed above. 

The first argument of the if command is a condition to be tested. If it is 
true, then the second argument, a list of commands, is executed. If not, the 
third argument is executed. The stop command ends the execution of the 
routine. Control returns to wherever the routine was called from. Recursive 



Fig. 1.4.5. (a) polygon 50 7 (b) polygon 1 360 
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routines will be used frequently. In the following example, the routine spiral 
calls itself to draw a spiral at a smaller size. The if condition insures that it 
does not run indefinitely. 

to spiral :size 

if :size < 1 [stop] [ 
forward :size 
right 60 

spiral :size * 0.8] 

end 


1.5 Sets Defined Recursively 

We will consider several sets that are defined recursively. Some of the ways 
used for defining the triadic Cantor dust and the Sierpiriski gasket are recur- 
sive. Most of the examples in this section are “dragon” curves. They can be 
described well using a recursive Logo program. 


The Koch Curve 

The first construction is of “trema” type. (Fig. 1.5.1.) It begins with a tri- 
angle Lq (including the interior) having angles of 120, 30, 30 degrees. This 
triangle can be subdivided into three smaller triangles: two isosceles trian- 
gles, angles 120, 30, 30, with long sides along the short sides of the orig- 
inal triangle; and one equilateral triangle. For the next approximation L i, 
the trema to be removed is the equilateral triangle. Then we repeat, again 
and again. The Koch curve is the “limit” (Why is it called a “curve”? See 
Proposition 2.4.10.) 

A second construction for the Koch curve is of “dragon” type. It involves 
approximations that are polygons. Here is a program in the Logo language. 
This is a good example of a “recursive program’.’ The procedure Koch calls 
itself four times to draw four copies of the curve at 1/3 the size. But each of 
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Fig. 1.5.1. Koch curve 


these four calls of Koch calls four more. And so on. The variable depth is used 
to end the recursion at a certain number of levels. 

to Koch : depth :size 

if : depth = 0 [forward :size stop] [ 

Koch : depth - 1 :size / 3 
left 60 

Koch : depth - 1 :size / 3 
right 120 

Koch : depth - 1 :size / 3 
left 60 

Koch : depth - 1 :size / 3] 

end 

The first set P 0 is a line segment: its picture (Fig. 1.5.2) is obtained by 
executing Koch 0 200. The value 200 is simply a convenient size. The next 
approximation Pi is obtained by executing Koch 1 200. And so on. The set 



Fig. 1.5.2. Results of program Koch 
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Fig. 1.5.3. Snowflake 


Pk consists of 4 fe line segments of length 3 -fe . The Koch curve is the “limit” 
P of the sequence Pk- 

The Koch curve can be obtained as an iterated function system construc- 
tion. For example, it is made up of 4 parts, each similar to the whole. See 
Plate 8. (Similarities are discussed in Sect. 2.1.) 

Three copies of the Koch curve, originating from three sides of an equi- 
lateral triangle, form a simple closed curve, often known as the snowflake 
curve (Fig. 1.5.3). 

Heighway’s Dragon 

(See Fig. 1.5.4.) Heighway’s dragon is a set in the plane. The approximation 
Pq is a line segment of length 1. The next approximation is P±; it is obtained 




Fig. 1.5.4. Heighway’s dragon 
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from P 0 by replacing the line segment by a polygon with two segments, each 
of length 1 / a/ 2, joined at a right angle. The two ends are the same as be- 
fore. (There are two choices of how this can be done. We choose the one on 
the “left” side.) For P 2 , each line segment in Pi is replaced by a polygon 
with two segments, each having length l/y/2 times the length of the segment 
that is replaced. The choices alternate between left and right, starting with 
left, counting from the endpoint on the bottom. Heighway’s dragon is the 
“limit” P of this sequence P n of polygons. 

The program below is shorter than the description given above, and (with 
a little study) is less prone to be misinterpreted.* This is a good reason for 
using Logo to describe the construction. 

; Heighway’s Dragon 

make "factor 1 / sqrt 2 

to heighway : depth :size : parity 

if : depth = 0 [forward :size stop] [ 
left : parity * 45 

heighway : depth - 1 :size * : factor 1 
right : parity * 90 

heighway : depth - 1 :size * : factor (-1) 
left : parity * 45] 

end 

The third argument is supposed to be either 1 or —1, depending on whether 
the next corner is supposed to go to the left or to the right. To generate Po, 
execute heighway 0 200 1. To generate Pi, execute heighway 1 200 1. 

Proposition 1.5.5. All of the approximations P n remain in some bounded 
region of the plane. 

Proof. Every point of Po has distance at most 1 from the endpoint. Every 
point of Pl has distance at most 1/2 from some point of Po- We can see by 
induction, that every point of Pk has distance at most (1/V2) fc+1 from some 
point of Pk- 1 - Therefore, any point of Pk has distance from the endpoint at 
most 



This is a geometric series with ratio < 1, so it converges to a finite value. □ 

The following exercise may be easier to approach after the discussion of 
similarities (Sect. 2.1) and convergence of a sequence sets (Sects. 2.4 and 2.5). 

Exercise 1.5.6. There are two similarities /i and / 2 of the plane onto itself, 
with ratio 1 / x/2, so that Heighway’s dragon P satisfies the equation P = 
/i[P]U/ 2 [P]. 

* Did you figure out what I was saying about “left” and “right”? 
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Proposition 1.5.7. In an approximation Pk of Heighway’s dragon , the poly- 
gon never crosses itself. 

Proof. It is possible that Pk visits some point twice. But I will show that it 
does not traverse any line segment twice, and when it visits a point twice, it 
does not cross itself. The vertices of the polygon Pk lie on a square lattice Lk 
with edge length (1/V2) fc . Suppose Pk visits a point twice without visiting 
an entire line segment twice; then that point must be a vertex of the square 
lattice. At each vertex, the polygon has a right-angle corner. So it does not 
cross itself there. 

I must only show that Pk does not include a line segment more than once. 
If it does, it must include a complete edge of the square lattice. Let k be an 
integer such that Pk does not include any line segment more than once. Let 
S be some square of the square lattice Lk- I will show that Pk+i does not 
include more than once any of the four line segments of Lk+\ inside S. 

Let t be an edge of Lk+i inside S. If t were traversed by Pk+i more than 
once, then the two sides of S adjacent to t (ei and e 2 in the figure) must both 
be traversed by Pk- Now let us color the squares of Lk in a checker-board 
pattern. The square to the left of the first line segment of Pk will be colored 
black, and the squares will alternate white and black. Since there is a right- 
angle turn between two consecutive segments of Pk, the square to the left is 
always the black square. By convention, let us say that the first line segment 
is “vertical’.’ The edges alternate between vertical and horizontal. Now when 
Pk+i is constructed, the new pairs of edges are placed alternately to the left 
and to the right of the old edges. So the new edges corresponding to a vertical 
edge of Pk will be in the black square bordered by the edge, and the new 
edges corresponding to a horizontal edge of Pk will be in the white square 
bordered by the edge. So not both of the edges e\ and e 2 will produce new 
edges inside S. □ 



Fig. 1.5.7. (a) Square Lattice. (b) The Square S 


1.5 Sets Defined Recursively 


23 


Fudgeflake 

Using an angle other than 90 degrees, we can obtain variant forms of Heigh- 
way’s dragon. In Fig. 1.5.8, the angle is 120 degrees. (For this version, left and 
right have been reversed). Three copies of this curve, joined in an equilateral 
triangle, surround a set known as the fudgeflake. A fudgeflake is made up of 
3 small fudgeflakes; so the fudgeflake tiles the plane (p. 39).* 

Sierpinski dragon 

Here is another dragon. (See Fig. 1.5.9.) 






Fig. 1.5.8. 


120-degree dragon and fudgeflake 



Fig. 1.5.9. Sierpinski dragon 


A set with fractal boundary that tiles the plane is known as a fractile. 
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to SD : depth :size : parity 

if : depth = 0 [forward :size stop] [ 
left 60 * : parity 
SD : depth - 1 :size / 2 (-: parity) 
right 60 * : parity 
SD : depth - 1 :size / 2 : parity 
right 60*: parity 

SD : depth - 1 :size / 2 (-: parity) 
left 60 * : parity] 

end 

Exercise 1.5.10. What does this dragon construction have to do with the 
Sierpinski gasket? 

McWorter’s Pentigree 

Fig. 1.5.11 illustrates another dragon; we will call it McWorter’s pentigree. 
It is a subset of the plane. The first approximation Pq is a line segment. In 
any future stage P n , each line segment of P n is replaced by six line segments 
in a particular pattern to form P n+ 1. Here is a Logo program. Why was the 
value (3 + \/5)/2 chosen for the value of the variable shrink? 

; McWorter’s pentigree 
make "shrink (3 + sqrt 5) / 2 
to pent : depth :size 

if : depth = 0 [forward :size stop] [ 
left 36 

pent : depth - 1 :size / : shrink 
left 72 

pent : depth - 1 :size / : shrink 





Fig. 1.5.11. McWorter’s Pentigree 
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right 144 
pent : depth 
right 72 
pent : depth 
left 72 
pent : depth 
left 72 
pent : depth 
right 36] 

end 


1 :size / : shrink 
1 :size / : shrink 
1 :size / : shrink 
1 :size / : shrink 


Exercise 1.5.12. Does an approximation P n for McWorter’s pentigree ever 
cross itself? 

Five copies of the pentigree fit together to form a set with five-fold rota- 
tional symmetry (Plate 1). This set will also be called “the second form of 
McWorter’s pentigree’.’ It can be thought of as made up of 6 sets similar to 
the whole, with ratio 2/(3 + y/E) = (3 — y/E)/2 (Plate 2). 

Consider the “translation” construction illustrated in Fig. 1.5.13. Set Lq is 
a single point. Set L\ is obtained from Lq by translating Lq in 5 equally spaced 
directions by some distance So together with Lq itself. Set L 2 is obtained from 
L i by translating it in 5 directions (the opposites of the previous directions) 



Fig. 1.5.13. Translation Construction 
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by distance si = rso, where r — (3 — \/5 ) /2. Set L 3 is obtained by translating 
L 2 in the original 5 directions by distance S 2 = rsi. And so on. 

Exercise 1.5.14. What does this construction have to do with McWorter’s 
pentigree? 

Barnsley’s Leaf 

Here is a set defined recursively in Logo form. If : depth is large enough so 
that you can watch it being drawn, you may see that most line-segments are 
re-traced many times. Some results are shown in Fig. 1.5.15. The limit B of 
these approximations will be called Barnsley’s leaf. 

make "stwo sqrt 2 
to leaf : depth :size 

if : depth < 1 [forward 2 * :size back 2 * :size] [ 
forward :size 
leaf : depth - 2 :size / 2 
back :size 
left 45 

leaf : depth - 1 :size / : stwo 
right 90 

leaf : depth - 1 :size / :stwo 
left 45] 

end 



Fig. 1.5.15. Barnsley’s leaf construction 
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Notice that one recursive copy is shrunk by factor 1/2, and two copies are 
shrunk by factor l/y/2. 

Exercise 1.5.16. Determine three similarities /i, //, / 3 of the plane into itself 
so that Barnsley’s leaf B satisfies the self-referential equation B = fi[B] U 
f 2 [B\Uf 3 [B\. 

A Julia Set 

The maps in an iterated function system used to construct a set need not be 
similarities. Write C for the set of complex numbers, geometrically thought of 
as a Euclidean plane. We briefly discuss the Julia set for a function p: C — > C 
given by <p(z) = z 2 + c. Here, c is a fixed complex number. (In the pictures, 
I used c = —0.15 + 0.72i because I liked the look of the result.) The map ip 
has inverse / defined by f(z) = y/ z — c. Actually, a complex number (except 
zero) has two square-roots, so there are two inverses, say fo(z) = y/z — c and 
fi(z) = — y/z — c. It won’t matter how we choose one of the two square-roots 
for fo because we will use them both. The Julia set in this case is a nonempty 
compact set J C C that satisfies the self-referential equation* 

J = fo[J] U fi[J]- 

A construction for J will work much as in the previous examples. See 
Fig. 1.5.17. Start with a set J 0 . (I used a parallelogram connecting four points 
which turn out to be the extremities of J.) Then recursively define 

Jn + 1 = /ot^n] U fl[Jn]‘ 

The limit of the sequence J n is J. The limit of a sequence of sets will be 
discussed in Sect. 2.5. 

Another possibility starts with J 0 as the circle \z\ = 2. This was chosen 
so that the image Ji is a curve that lies inside that circle. And subsequent 
images J n will curves, each be inside its predecessor. Plate 14 shows all of 
these drawn in the same picture, with the bands between successive curves 
colored in different colors. Points inside all the curves are black, this is the 
filled-in Julia set. This picture is the “escape time” coloring often seen (for 
example [3, plate 13]) but this time constructed using the inverses of ip. 


Reflections in a Circle (on a Sphere) 

Some fractal examples lie on the surface of a sphere. To represent them in 
drawings on a flat page, we sometimes use “stereographic projection” to set 

* To get uniqueness, we exclude the point c where the two square-roots coincide, 
thus requiring that J C C \ {c}. Otherwise, the filled-in Julia set would be a 
second possibility. Compact sets are discussed in Sect. 2.3. 
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Fig. 1.5.17. Julia construction 

up a correspondence between the sphere £ and a plane 77. In Fig. 1.5.18, 
the horizontal plane 77 is tangent to the “south pole” of the sphere £. Each 
point S on £ (except the north pole itself) together with the north pole N 
determines a line NS that meets 77 in a unique point P. The correspondence 
S * — * P is called stereographic projection. 



Fig. 1.5.18. Stereographic projection 
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Fig. 1.5.20. Reflection in a circle 


Exercise 1.5.19. Under stereographic projection, a circle C on the sphere E 
maps to either (1) a circle on 17 or (2) a line on 77. 

Maps of the sphere that are easy to visualize, such as rotations about a 
diameter, may not be as easy to visualize as maps of the plane. And vice 
versa. If C is a circle on E, we want to define a map of E to itself described as 
“reflection in the circle” C. When C is a great circle, that is easy to imagine 
as a map of E to itself. When the circle passes through the north pole TV, then 
reflection in C becomes reflection in a line in 77, which is easy to imagine. 
For a general circle we may proceed as follows: rotate the sphere to map 
the circle C to a circle C through TV. Then reflect in C' which is reflection 
in a line in the stereographic projection. Finally, rotate back to the starting 
orientation. 

Exercise 1.5.21. Let the set C of complex numbers be identified with the 
plane 77 in such a way that the equator of E corresponds to the “unit circle” 
K = {z £ C : |z| = 1 } of the complex plane. The reflection in K corresponds 
to reflection in the equator. Show that this reflection, in terms of complex 
numbers, is described by 

1 

z e- > — — . 
z 

Of course the point z = 0 corresponds to the south pole, and that reflects 
to the north pole, so the image of 0 under the reflection in K is not a point 
of C. 

Figure 1.5.22(a) depicts Pharaoh’s breastplate , which is a fractal con- 
structed on a sphere. First we choose circles (called “generators”) as in 
Fig. 1.5.22(b): six circles, each tangent to four of the others. The picture 
is the stereographic projection — the dashed circle is the equator (not one of 
the six); the vertical line down the middle is of course a circle through the 
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Fig. 1.5.22. (a) Pharaoh’s breastplate (b) Generators (c) Osculating basis 


north pole. These six circles define six reflection maps fi, - ■ ■ , of the sphere 
to itself. The fractal P is the smallest nonempty closed* set invariant under 
all six of these reflections: P = fi[P] = f 2 [P] = ■ ■ ■ = fe[P ]■ 

There is an efficient way to draw the fractal. We define an “osculating ba- 
sis” of eight disks, each with bounding circle orthogonal to three of the gener- 
ators and not meeting the other three generators. In Fig. 1.5.22(c), these disks 
are in white. One of them is the northern hemisphere (in the stereographic 
projection, the outside of the unit circle). 

The first approximation Pq is the entire sphere. The first tremas Uq to 
be removed are the open disks of the osculating basis. This leaves the the 
approximation Pi. The next tremas to be removed are the images of Uq under 
the six reflections. (Some of the images are repeats.) This leaves the set P 2 . 
We continue in this way recursively, and the limit P is the intersection of the 
approximations P n . 

A colorful image is contracted as follows. Color the disks of the osculating 
basis in up to eight colors, then when you take an image of a trema, use the 
same color. The complement of P will then be colored in the colors chosen. 
See Plate 9. 



Fig. 1.5.23. Construction by tremas 


Closed set is defined on p. 47. 


1.6 Number Systems 
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1.6 Number Systems 


Let us consider the usual decimal representation of real numbers, and how it 
can be generalized. Suppose we have a number 6 for the base (or “radix”) of 
our number system, and a finite set D = {di, c? 2 , • • • , dk} of numbers, called 
“digits’.’ We will always assume that 0 is one of the digits. A “whole number” 
for this system will have the form 

M 

E®^ (!) 

J= o 

where each aj £ D. Write W for the set of whole numbers. A “fraction” for 
this system will have the form 

E (2) 

j=~ oo 

where each a,j G D. Write F for the set of fractions. The general number 
represented by this system is the sum of one of each: 

M 

E a $- ( 3 ) 

j=-oo 

In order for the representations (2) to converge, we must have |6| > 1. 
Our usual decimal number system has 

6=10 and D = {0,1, 2, 3, 4, 5, 6, 7, 8, 9}. 


The binary number system has 


6 = 2 and D = {0,1}. 


In these two cases, the numbers of the form (1) are exactly the nonnegative 
integers, and the numbers of the form (2) are the elements of the interval [0, 1]. 
So the numbers of the form (3) are exactly the nonnegative real numbers, 
[0,oo). In both of these cases, there are numbers that have no representation 
in the form (3), namely the negative numbers. 

Exercise 1.6.1. Let 6 = —2 and D = {0,1}. The numbers of the form (1) 
are exactly the integers. The numbers of the form (2) constitute the closed 
interval [—2/3, 1/3]. Every real number has the form (3). 

Both the decimal and binary systems have the fault that some numbers 
have two different representations. For example 


o x io _i + E 9 x w ~ j = 1 x io_1 + E 0 x 10_J - 

J= 2 3=2 

A system related to the Cantor set avoids this fault. 
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Exercise 1.6.2. Let 6=3 and D = {0, 2}. Then no number has two different 
representations in the form (3). 

But in this case, some numbers have no representation at all (such as 1/2). 
Can we avoid both problems at once? 

Exercise 1.6.3. Let 6 be a real number with |6| > 1, and let D be a finite set 
of real numbers including 0. Then either some real number has no expansion 
in the form (3) or some real number has more than one expansion in the 
form (3). 

Next, let us consider a number system to represent complex numbers. Now 
the base 6 may be a complex number, and the digit-set D is a finite set of 
complex numbers (including 0). We are interested in representing complex 
numbers in the form 

M 

a o y ' ( 3 ) 

j=-oo 

where all a,j G D. 

Exercise 1.6.4. Let 6 be a complex number with |6| > 1, and let D be a 
finite set of complex numbers including 0. Then either some complex number 
has no expansion in the form (3) or some complex number has more than one 
expansion in the form (3). 

Exercise 1.6.5. If every complex number has an expansion of the form (3), 
then in fact there is a complex number with at least three different expansions 
in the form (3). 

One useful property that a number system for complex numbers might 
have is that the set W of whole numbers is the set of “algebraic integers” of a 
number field.* Two examples: The complex numbers of the form u + iv, where 
u and v are integers; they are called the Gaussian integers. For a second 
example, let 

—1 + i-\/3 2-7T 2-7T 

u> = = cos b i sin — . 

2 3 3 

Note that tu 3 = 1, and ui 2 = uJ = l/u. The complex numbers of the form 
u + vu> ( u and v integers) are sometimes known as the Eisenstein integers J 


Exercise 1.6.6. Let b = —1 + i and D = {0, 1}. Describe the set W of whole 
numbers (1) for this number system. 


An algebraic integer is a complex number that is a zero of a polynomial such that 
the coefficients are integers and the coefficient of the highest-degree term is 1. The 
number held Q(i) is made up of the complex numbers of the form u + vi, where 
u and v are rational numbers. It turns out that such a number is an algebraic 
integer if and only if u and v are both integers. 

They are the algebraic integers of the number held Q(a/— 3 ). 
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Fig. 1.6.7. Twindragon Construction 


A construction of the set F of “fractions” for the number system with base 
b = — 1 + i and digit set D = {0,1} is illustrated in Fig. 1.6.7. For the first 
approximation, we consider the set W of “whole numbers” in this case the 
Gaussian integers; they form a regular square lattice So in the complex plane. 
The points that are closer to 0 than to any other Gaussian integer form a 
square. Call this square Lq. Next, consider the numbers representable in our 
system using at most one place to the right of the radix point. They, too, form 
a square lattice Si in the complex plane, but with shorter sides, and at an 
angle. The set of points closer to 0 than to any other element of Si is a square, 
and the set of points closer to (.1) _i+i than to any other element of Si is also 
a square. These two squares constitute a set L \ , the next approximation of 
F. When we take two digits to the right of the radix point, we get a set L 2 
made up of four squares. Continuing in this way, we obtain F as the limit of 
a sequence (Lk) of approximations. 



Fig. 1.6.8. Twindragon 
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The set obtained in this way is the twindragon. Plate 4 suggests that it is 
made up of two copies of Heighway’s dragon. Since every complex number can 
be represented in this number system, the plane is covered by countably many 
twindragons, namely the sets w + F, one for each Gaussian integer w. The 
sets w + F overlap only in their boundaries, so this constitutes a tiling of the 
plane. The twindragon (we will see later) has a fractal boundary. Therefore 
the twindragon is a “fractile” 

This set F can be seen from the point of view of an iterated function 
system. The set F is the union of two parts, namely the set Fq of all numbers 
of the form (2) with a_i = 0, and the set F\ of all numbers of the form 
(2) with a_i = 1. Now the elements of Fq are exactly 6 -1 times the elements 
of F\ and the elements of F\ are of the form & _1 + b~ 1 x, where x £ F. So if 
we write 


fo(x) = b 1 x, 
fi(x) = b~ l + b~ 1 x, 

then we have a self-referential equation 


F = f 0 [F\Uf 1 [F\. 

So F is the invariant set for this iterated function system. See Plate 5. 

Exercise 1.6.9. Find a complex number with three representations in this 
system. 

Let us turn next to the Eisenstein number system. 

Exercise 1.6.10. Let b = —2 and D = {0, l,w,u) 2 }. Describe the set W of 
whole numbers. 

For calculations in this system, it may be easier to write A = u> and 
B = w 2 . The set F of fractions is pictured in Fig. 1.6.11. See also Plate 6. 



Fig. 1.6.11. Eisenstein fractions 
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Exercise 1.6.12. Describe an iterated function system for the Eisenstein 
fractions. 

The Eisenstein fractions make up another “fractile’.’ 


1.7 * Remarks 

Fractal sets, such as those we have seen in this chapter, have been used by 
mathematicians over the years. Only since Mandelbrot’s book has there been 
interest in them beyond mathematics. Even among mathematicians, these sets 
had often been considered to have little interest. 

I recall a discussion I had with one of my colleagues some years ago. We 
were talking about a problem dealing with abstract integration. In the course 
of the discussion, it became apparent that I considered the Cantor set a more 
natural setting for the problem than the interval. The colleague was surprised 
by that. He had thought that the Cantor set was only a pathological coun- 
terexample. I wonder what he would have said about the Sierpinski gasket. 

In 1883, Georg Cantor published a description of the set that today bears 
his name. It is known as the “Cantor discontinuum” the “Cantor middle- 
thirds set” or simply the “Cantor set’.’ Over the years it has come to occupy 
a special place at the heart of descriptive set theory. It was the most important 
example used by Hausdorff in his paper [32] on fractional dimension. 

The “iterated function system” was named and popularized by Michael 
Barnsley [2], [3]. 

Waclaw Sierpinski published his description of the “Sierpinski gasket” 
in 1915. This name was assigned to it by Mandelbrot. The pre-Mandelbrot 
literature calls it something like “Sierpinski ’s other curve’.’ (It was called 
a curve, despite its appearance, because it has topological dimension 1; see 
Chap. 3. “Sierpinski’s curve” was already used to refer to another example, 
called Sierpinski’s carpet — dywan Sierpiriskiego [44, p. 144].) 

The use of “strings” in this book may be a bit unusual. The more con- 
ventional terminology involves “sequences” from the set {0, 1}, so there is no 
essential difference. Many students will be familiar with strings from their 
computer-related courses (but probably not infinite strings). Sequences are 
used in other ways in this book, so I hope this terminology will reduce the 
confusion a little. 

The description of dragon curves is done naturally using recursive com- 
puter programs. Instead of using a pseudo-code to formulate such programs, 
or (worse yet) inventing another ad-hoc notation, I have chosen to use a real 
computer language. One of my favorite languages, especially for drawing pic- 
tures, is Logo. Some programmers have negative opinions concerning Logo, 
because it has been used to teach small children. But that should not be held 
against it. There are programming tasks that I would not use Logo for, but the 
simple recursive pictures that are of concern in this book seem suited to Logo. 
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A reference for turtle graphics is [1]. It contains some interesting ideas on 
plane geometry. It contains a solution for Exercise 1.4.6: 

to ellipse :s :e 
make "n 1 

repeat 360 [right :n forward :s left :n 
left :n forward (:s * :e) right :n 
make "n :n + 1] 

end 

If you show the turtle, and ask him to do ellipse 1 0.5, you will see that 
he is doing a lot of work.* 

H. von Koch’s curve dates from 1904. It is a continuous curve that has a 
tangent line nowhere. The closed “snowflake” version of Fig. 1.5.3 is sometimes 
used as an example of a curve of infinite length surrounding a finite area. We 
will see later that the snowflake curve has fractal dimension strictly larger 
than 1. This is a much more precise assertion than merely saying the curve 
has infinite length. 

Heighway’s dragon dates from about 1967; according to Martin Gard- 
ner [28], it was discovered by physicist John E. Heighway, and studied by 
Heighway together with physicists Bruce A. Banks and William G. Harter. 
This dragon was publicized in [13], which contains a wealth of information 
about the polygonal approximations. Proposition 1.5.7, which states that a 
polygonal approximation does not cross itself, was observed by Harter and 
Heighway; a proof was published by Davis and Knuth [13] . The proof of Propo- 
sition 1.5.7 that appears here contains elements of a proof submitted by Brian 
Conrad, Jon Grantham, and Roger Lee during the Ross summer program. 
The name “Heighway” is spelled “Heightway” in some of the references on 
the subject. 

The fudgeflake is found in [44, p. 72]. Mandelbrot considered this shape 
to be derived by “fudging” the snowflake (Fig. 1.5.3) using alternating left 
and right. William McWorter described the pentigree in [50]. “Pentigree” 
is from “pentagon-filligree’.’ The first edition of this book contains the first 
published analysis of this interesting dragon. Slightly changing the definition 
yields another interesting result, the pentadendrite (see Plate 3). 

make "off angle 11.82 

make "shrink 1 / 2.87 

to dend : depth :size 

if : depth = 0 [forward :size stop] [ 
left : off angle 

dend : depth - 1 :size * : shrink 
left 72 

dend : depth - 1 :size * : shrink 
right 72 


* This was true in 1990. But nowadays computers may be so fast that you won’t 
see this. 
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dend : depth - 
right 144 

1 

: size 

* 

: shrink 

dend : depth - 
left 72 

1 

: size 

* 

: shrink 

dend : depth - 
left 72 

1 

: size 

* 

: shrink 

dend : depth - 

1 

: size 

* 

: shrink 

right : off angle] 





end 

Michael Barnsley’s leaf is found in [4, p. 330 and Fig. 4.8]. This is part 
of his discussion of methods to concoct an iterated function system that ap- 
proximates a previously given picture. 

Julia sets are named for Gaston Julia. He and Pierre Fatou are both 
credited with creating the theory of iteration in the complex plane at about 
the same time (1918). There was a sometimes bitter priority dispute between 
the two of them about this material. 

The fractal called Pharaoh’s breastplate was described by Mandelbrot in 
[46, p. 126], 

The fractal sets associated with complex number systems are discussed, 
for example, in [13], [29], [30]. The statement of Exercise 1.6.1 was improved 
by a proof submitted by Dan Bernstein, Keith Conrad, and Paul Lefelhocz 
during the Ross summer program. Exercises 1.6.3 and 1.6.4 are stated as 
“either/or’.’ It is possible that both alternatives occur: there is some number 
with more than one expansion, while there is another number with no expan- 
sion. For example, modify the Eisenstein number system (Exercise 1.6.10) by 
using base b = —2, but using only 3 of the digits, say D = {0, 1 , w}. (Thanks 
to Peter Hinow for this example.) 

Here is a hint for Exercise 1.1.2. To see that 1/4 is not an endpoint, prove 
by induction that all endpoints are of the form m/3 k , for nonnegative integers 
m, k. Then use unique factorization to show that 1/4 is not of this form. To 
show that 1/4 £ C, prove by induction on k that 1/4 £ G for all k. In fact, 
it is easier if you prove more: both 1/4 and 3/4 are in Ck for all k. 

Si el le etait douee de vie, 
il ne serait pas possible de I’aneantir, 
sans la supprimer d'emblee 
car el le renaTtrait sans cesse 
des profondeurs de ses triangles, 
comme la vie dans I’univers. 

— E. Cesaro, 1905 

If it [the Koch curve] were given life, 
it would not be possible to destroy it 
except by doing away with it all at once, 
since it would be endlessly reborn 
of the depths of its triangles, 
like life in the universe. 
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Metric Topology 


This chapter contains the mathematical background for much of the rest of 
the book. If the book were organized in strictly logical fashion, then this would 
be the first chapter of the book; but I included instead some more fractal-like 
material as Chap. 1. Chapter 2 is a more technical chapter. Have patience! It 
really is useful for the understanding of the rest of the book. 

Mathematics students will eventually learn almost everything in this chap- 
ter in the normal course of their studies. So many readers may be able to skip 
this chapter completely; but it is here for those who need it. Many of the 
proofs (and exercises) are merely the usual real-number proofs adapted to the 
setting of metric spaces. So a student who has experience dealing with the 
proofs of ordinary calculus will see many familiar ideas. A reader who does 
not care about being mathematically rigorous* could skip the proofs in this 
chapter. Metric topology is, in fact, important for a lot of modern mathemat- 
ics. The selection of topics for this chapter was determined by what is required 
later in the book; so this chapter is a bit peculiar as an introduction to metric 
topology. 


2.1 Metric Space 

A metric space is a set S together with a function g: S x S — > [0,oo) 
satisfying 

g{x,y) = 0 x = y- 

Q{x,y) = g(y, x)\ 
g(x,z) < g(x, y) + g{y,z). 

The last inequality is known as the triangle inequality : in Euclidean geom- 
etry, it says that the sum of the lengths of two sides of a triangle is at least 

* “A simple man believes every word he hears; a clever man understands the need 
for proof.” (Proverbs 14:15, New English Bible) 
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equal to the length of the third side. The nonnegative real number g(x, y) is 
called the distance between x and y. The function g itself is called a metric 
on the set S. A metric space may be written as a pair (S', g), but if the metric 
is understood, it will be referred to simply as S. 


Examples 

Let us consider a few examples of metric spaces. 

Theorem 2.1.1. The set R of real numbers, with the function 

g{x,y) = \x-y\ 


is a metric space. 

Proof. First, note that \x — y\ > 0. Also, \x — y\ = 0 if and only if x = y. Next, 
g( x, y) = \x — y\ = \ — (x — y)\ = \y — x\ = g(y, x). For the triangle inequality, 
let us consider several cases: 

(1) x <y <z: Then g(x, y) + g(y, z) = (y - x) + (z - y) = z - x = g(x, z). 

(2) x < z < y: Then g{x, y) + g(y, z) = (y — x) + (y ~ z) > y — x > z — x = 
g{x,z). 

(3) y <x< z: Then g(x, y)+g(y, z) = (x-y)+(z-y) > z-y > z-x = g(x, z). 

(4) y < z < x: Then g[x, y) + g(y, z) = (x — y) + (z — y) > x — y > x — a = 
g(x,z). 

(5) z < y < x: Then g(x, y) + g(y, z) = {x - y) + (y - z) = x - z = g(x, z). 

(6) z < x < y: Then g(x, y)+g(y, z) = (y—x)+(y—z) > y—z > x—z = g(x, z). 

□ 

Exercise 2.1.2. When is the triangle inequality actually an equality in the 
metric space R? 

If d is a positive integer, then R d is the set of all ordered d-tuples of 
real numbers. We can define several operations in this setting. For x = 
(Xi, x 2 , ■ ■ ■ , Xd) G R d , y = ( 2 / 1 , 2 / 2 , ■ • • , yd) e R d , and s G R, define 

SX = {SX\,SX2, ■ ■ ■ , SXd), 

x + y = {xi + yi,x 2 + yii- ■ • ,x d + yd), 
x-y = x + (-1 )y, 

\x\ = yj Xl + Xl + ---+X 2 d . 

We define d-dimensional Euclidean space to be the set R d with the metric 
g{x,y) = \x- y\. 

In order to show that this is a metric space , I will prove two basic in- 
equalities. 
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Theorem 2.1.3 (Cauchy’s inequality). Let x\, X 2 , ■■■ ,x d , yi, Vi,"' iVd 
be 2d real numbers. Then 


d 

Y 

U =1 


me*? e« 


u =1 


0=1 


Proof. If A is any real number, then 

d 

Y( X J ~ Xy j ) 3 - °' 

i=i 

Multiplying this out and collecting terms, we see 


a 2 - 2 \Y x m a + E*? ^ o. 


ij =i 


o'=i 


o'=i 


This is true for all real numbers A. But in order for a quadratic polynomial 
A A 2 + BA + C to be nonnegative for all A, it is necessary that B 2 — 4 AC < 0. 
In this case, it means 


4 ( Y x m - 4 Ev? E*? - °> 


o=i 


o'=i 


o'=i 


which is equivalent to the inequality to be proved. 


□ 


Theorem 2.1.4 (Minkowski’s inequality). Let x,y £ R d . Then \x + y\ < 

M + \y\- 

Proof. Write x = (xi, x 2 , ■ ■ ■ ,x d ) and y = (j/i, y 2 , ■ ■ • ,Vd )■ Then 


■ + y \ 2 = Y( x i +y rf 


1=1 

d 


— Yj x j + 2 E x i y i + E ■ 


1=1 

d 


1=1 


1=1 

1/2 


1/2 


<E*E 2 IEM E 


1=1 


^ 1=1 

1/2 


0 = 1 


E 

1=1 




W 

= (\x\ + \y\) 2 . 


d 

Y y2 i 

0 = 1 


1/2N 
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By taking the square root of the extremes, we may conclude (since both of 
these terms are nonnegative) 


\x + y\ < |x| + \y\. □ 

Corollary 2.1.5. The space is a metric space with the metric g{x,y) = 

\x-y\- 


Proof. Write x = (xi, x 2 , ■ ■ ■ , x d ) and y = (t/i, y 2 , ■ ■ ■ , yd)- First, 

q(x, y) = \J (yi - Xi) 2 + (y 2 - x 2 ) 2 H \-(y d ~ x d ) 2 > 0. 

If g(x, y) = 0, then (yi — Xi ) 2 + (y2 — X2) 2 H h (y d — x d ) 2 = 0. But a square 

is nonnegative, so this means that all terms must be 0. That is, Xj = yj for 
all j, so that x = y. The equation g(x, y) = g(y , x) is clear. For the triangle 
inequality, we apply Minkowski’s inequality: 

q(x, y) + e(y, z) = \x-y\ + \y- z\ 

>\{x-y) + {y- z)\ = \x- z\ = g{x,z). □ 

Exercise 2.1.6. Find necessary and sufficient conditions for Cauchy’s in- 
equality to be an equality in 

Exercise 2.1.7. Find necessary and sufficient conditions for Minkowski’s in- 
equality to be an equality in 

Next we will consider the set E ^ of infinite strings from the two-letter 
alphabet E = {0,1}. We will define a metric gi/2 for this space. The basic 
idea is that two strings should be considered “close” if they begin in the same 
way. 

So let a and r be two infinite strings. If a = r, then of course the distance 
must be zero: 

Qi/ 2 (ct,<t) = 0. 

If <7 ^ r, then there is a first time they disagree: we can write a = aa 1 , 
r = oct' , where a is a (possibly empty) finite string, and the first character of 
a' is different than the first character of r'. (In the language used before, a is 
the longest common prefix of a and r.) If k is the length of a , then define 

Qi/ 2 {v,t) = 

Proposition 2.1.8. The set is a metric space under the metric g\/2- 

Proof. Clearly 01/2 (o', i") > 0. If a ^ r, then 01/2(0,7) = (l/2) fc > 0. The 
equation 01/2(0,7) = 01/2(7,0) is also clear. 

So all that remains is the triangle inequality. Let a, 7, 9 be three strings. 
I will prove 01/2(0,7) < max{0 1 / 2 (o, 0), 0 1 / 2 (0, 7)}. If two of the strings are 
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equal, then this is clear. So suppose they are all different. Let k be the length 
of the longest common prefix of a and 9 , and let m be the length of the longest 
common prefix of 6 and r. If n = minjfc, to}, we know that the first n letters 
of cr agree with the first n letters of 9; and the first n letters of 9 agree with 
the first n letters of r. Therefore, the first n letters of a agree with the first 
n letters of r. So the longest common prefix of a and r has length at least n. 
Therefore: 


g 1/2 (a,r) <(l/2) n = (1/2)™^^ 

= max{(l/2) fc , (1/2)" 1 } 

= max{p 1/2 (c7, 9), g 1 / 2 (0 1 r)}. 

Finally, this “ultra-triangle” inequality implies the ordinary triangle inequal- 
ity, since 


max{p 1/2 (cr, 0), g 1/2 (0,T)} < gi/ 2 {a,0) + gi/ 2 (9,r). □ 

Related Definitions 

If S' is a metric space with metric g , and T C S, then T is also a metric space 
with metric gr defined by 

gr{x, y) = g{x , y ) for x,y £ T. 

In the future, we will usually write simply g rather than g? . 

The diameter of a subset A of a metric space S is 

diam A = sup { g(x , y) : x, y £ A } . 

The diameter of A is the distance between the two most distant points of A, 
if such points exist. But, for example, if A = [0, 1), the diameter is 1. Even 
though no two points of A have distance exactly 1, there are pairs x,y of 
points of A with distance as close as we like to 1; and there are no pairs x,y 
of points of A with distance greater than 1. 

If A and B are nonempty sets in a metric space S , define the distance 
between them by 


dist(A, B) = inf { g(x, y) : x £ A, y € B } . 

Note that this is not a metric, for example because the triangle inequality 
fails. If A = 0 and B = (0, 1] in M, then dist(^4, B) = 0 even though A / B. 

Let S be a metric space, x £ S, and r > 0. The open ball with center x 
and radius r is the set B r [x ) = { y £ S : g(y,x) < r}. The closed ball with 
center x and radius r is the set B r (x) = {y £ S : g(y,x) < r }. 

Let S be a metric space, and let A be a subset. An interior point of A 
is a point x so that B e (x) C A for some e > 0. A set A is called an open set 
iff every point of A is an interior point. 
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Fig. 2.1.9. (a) 



Proposition 2.1.10. An open ball B r {x) is an open set. 

Proof. Let y £ B r (x). Then g(x,y) < r, so that e = r — g(x,y) is positive. 
The triangle inequality shows that B e (y) C B r {x). So y is an interior point of 
B r (x). □ 

Theorem 2.1.11. Let S be a metric space. Then 0 and S are open sets. If 
U and V are open sets, so is U C \V. IfU is any family of open sets, then the 
union 

U u 

UeU 

is also open. 


Proof. Certainly every point of 0 has whatever property I choose to name, 
such as being an interior point. So 0 is an open set. 

Let x £ S. Then certainly B\{x) C S. So S is an open set. 

Suppose U and V are both open. Let x £ U fl V. Then x is an inte- 
rior point of U, so there is E\ > 0 with B Sl ( x) C U. Also, x is an interior 
point of V, so there is £2 > 0 with B e2 (x) C V. Therefore, if e is the min- 
imum of £1 and £ 2 , then we have B e ( x) C U fl V. So U fl V is an open 
set. 

Let 11 be a family of open sets, and write 

v ={jU. 

UGU 

Let x £ V. Then x £ U for some 1/ G 11. So there is £ > 0 with B e (x ) C U C V. 
Therefore V is an open set. □ 

Let S be a metric space, and let A C S. A point x £ S is an accumulation 
point of A iff, for every e > 0, the ball B e ( x) contains points of A other than 
x. A set A is closed iff it contains all of its accumulation points. Comparing 
this to the definition of “open set” we can easily see that a set A is closed if 
and only if its complement S \ A is open. 
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Exercise 2.1.12. Let S' be a metric space. Then 0 and S are closed sets. If 
A and B are closed sets, so is A U B. If C is any family of closed sets, then 
the intersection 

IV 

Ace 

is also closed. 

Proposition 2.1.13. A closed ball B r (x ) is a closed set. 

Proof. Suppose y g B r {x). Then g(x, y) > r, so that e = g(x, y) — r is positive. 
The triangle inequality shows that B E (y) D B r {x) = 0. Therefore y is not an 
accumulation point of B r (x). This shows that B r (x) is a closed set. □ 

A family 3 of open subsets of a metric space S is called a base for the 
open sets of S iff, for every open set AC. S, and every x £ A, there is U £ 3 
such that x £ U C A. What would be a good definition for a “base for the 
closed sets”? 

Exercise 2.1.14. A family 3 of open subsets of a metric space S' is a base 
for the open sets if and only if every open set T is of the form 

T = U A 

AeA 


for some AC 3. 

Of course the definition of “open set” shows that the collection of all open 
balls is a base for the open sets of S. 

An ultrametric space S is a metric space for which the metric g satisfies 

the ultra-triangle inequality: 

Q{x , z) < max{g(x, y), g{y, z)}. 

Note that in Proposition 2.1.8, we proved that g 1 / 2 ) is an ultrametric 

space. 

The properties of an ultrametric space may seem strange if you are familiar 
only with Euclidean space and its subsets. Here are a few examples to help 
you understand the situation. 

Exercise 2.1.15. Let S be an ultrametric space. Prove: 

(1) Every triangle is isosceles: that is, if x,y,z £ S, then at least two of 
e(x, y), g(y, z), g{x , z) are equal. 

(2) A ball B r (x) of radius r has diameter at most r. 

(3) Every point of a ball is a center: that is, if y £ B r (x), then B r (x) = B r (y). 

(4) A closed ball is an open set. 

(5) An open ball is a closed set. 
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2.2 Metric Structures 


Metric spaces support many of the concepts that are well-known from Eu- 
clidean space. We will discuss functions on metric spaces and sequences in 
metric spaces. 


Functions on Metric Spaces 

Suppose S and T are metric spaces. A function h: S — > T is an isometry iff 

g T {h(x),h{y)) = g s {x,y) 

for all x, y £ S. Two metric spaces are isometric iff there is an isometry of 
one onto the other. A “property” is called a metric property iff it is preserved 
by isometry, that is: if S and T are isometric, and one has the property, then 
so does the other. 

What are the isometries of the Euclidean plane R 2 into itself? Some ex- 
amples are pictured in Fig. 2.2.3. In fact, the maps of these types are the only 
isometries of R 2 . (But, of course, there are infinitely many isometries of each 
of the four types.) Here is an outline of the proof: 

(1) Let ABC and A'B'C' be two congruent triangles in the plane. Any point 
P determines a corresponding point P' such that 

\A — P\ = \A' - P'1, \B-P\= | B’ - P'\, and \C - P\ = \C - P'\. 

(2) Likewise another point Q yields Q' , and \P — Q\ — \P' — Q'\. 

(3) Any two congruent triangles are related by a unique isometry. 

(4) Two given congruent line segments AB, A' B' are related by just two 
isometries: one direct and one opposite. 

(5) Any isometry with an invariant point is a rotation or a reflection. 

(6) An isometry with no invariant point is a translation or a glide-reflection. 

A detailed argument along these lines may be found in [12, Chap. 3]. 

A function h: S — > T is a similarity iff there is a positive number r such 
that 

g(h(x),h(y)) = rg{x,y ) 

for all x,y £ S. (An alternate term is similitude.) The number r is the ratio 
of h. Two metric spaces are similar iff there is a similarity of one onto the 
other. 

Exercise 2.2.1. Let / be the dilation of R 2 with center a and ratio r > 0. 
Then 

1/0*0 - f(y) | = r\x-y\ 


for all x,y € R 2 . 
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Fig. 2.2.3. Isometries of the Plane 


O /> 
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Exercise 2.2.2. Describe all of the similarities of two-dimensional Euclidean 
space R 2 onto itself. 

Let S and T be metric spaces. Let x £ S. A function h: S — > T is con- 
tinuous at x iff, for every e > 0, there is S > 0 such that for all y £ S 

g(x,y)<6 => g(h(x),h(y)) < e. 

The function h is simply called continuous iff it is continuous at every point 
x £ S. This is one of the ten most important definitions in all of mathe- 
matics. A thorough understanding of it will be useful to you not only in the 
study of fractal geometry, but also in much of the other mathematics you will 
study. 

Exercise 2.2.4. Let h: S — > T, let x £ S, and let e, <5 > 0. Then we have 
g{x 1 y)<8 => g(h(x),h(y)) <e 

for all y £ S if and only if 


h[B s (x)] C B e (h(x)). 

Exercise 2.2.5. Isometries and similarities are continuous functions. 

Continuity can be phrased in terms of open sets. 

Theorem 2.2.6. A function h: S — > T is continuous if and only if h~ l \V] is 
open in S for all V open in T. 

Proof. First, suppose that h is continuous. Let V be an open set in T. I must 
show that h~ x \V] is an open set in S. So let x £ /i -1 [E]. Then h(x) £ V, 
which is open, so there is e > 0 with B e (h(x)) C V. By the continuity of h, 
there is S > 0 such that h [B&(x)\ C B e (h(x)) C V. Therefore Bs(x) C /i _1 [E]. 
So h -1 [E] is an open set. 

Conversely, suppose that is open in S whenever V is open in T. 

Let x £ S. I must show that h is continuous at x. Let e > 0. then B e (h( x)) is 
an open set in T. So W = h~ 1 [B e {h{x))] is an open set in S. Now x £ W, so 
there is <5 > 0 with Bg(x ) C W. Therefore h[Bg(x)\ C h[W] C B e (h(x)). So h 
is continuous. □ 

Exercise 2.2.7. Let 3 be a base for the open sets of T. A function h: S — > T 
is continuous if and only if h _1 [E] is open in S for all V £3. 

A function h : S — > T is a homeomorphism of S onto T iff it is bijective, 
and both h and h -1 are continuous. Two metric spaces are homeomorphic 
iff there is a homeomorphism of one onto the other. A “property” is known 
as a topological property iff it is preserved by homeomorphism. 
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Sequences in a Metric Space 

Let S' be a set. A sequence in S is, strictly speaking, a function /: N — > S. It 
is defined by the infinite list of values /( 1), /( 2), /(3), • • • . Often we will write 
something like 

( a '")nGN 

and understand that the function is specified by /( 1 ) = x\, /( 2 ) = x^, and so 
on. We may even write simply (x n ). 

A sequence (x n ) in a metric space S converges to the point x £ S iff 
for every e > 0, there is TV £ N so that g(x n ,x) < £ for all n > N. If this 
happens, we write limn—.^ x n = x , or x n — > x. Also, x is called the limit of 
the sequence (x n ). We say that the sequence is convergent iff it converges 
to some point. 

In order for the definitions to make sense the way they have been worded 
here, we need to know that limits are unique: 

Exercise 2.2.8. Let (x n ) be a sequence in a metric space S. If x n — > a and 
x n — > b , then a = b. 

Many of the definitions of this section have equivalent formulations in 
terms of sequences. 

Theorem 2.2.9. Let S and T be metric spaces, and let h: S — > T be a func- 
tion. Then h is continuous if and only if, for every sequence (x n ) in S, 

x n — > x => h(x n ) — > h(x). 

Proof. First, suppose h is continuous. Let ( x n ) be a sequence in S, and suppose 
x n — > x. I must prove that h(x n ) — > h(x). So let e > 0. Since h is continuous 
at x, there is <5 > 0 with h [S^(x)] C B e {h{x)). Since x n — > x, there is N £ N 
so that x n £ Bs(x) for all n > N. But then h(x n ) £ B e (h(x )) for all n > N. 
This shows that h(x n ) — > h{x). 

For the other direction, I will prove the contrapositive. Suppose h is not 
continuous. I must prove that the convergence property 

x n — > x => h(x n ) — > h{x) 

fails. Since h is not continuous, there exist x £ S and e > 0 such that for all 
8 > 0, there exists y £ S with g(x, y) < 8 but g(h(x), h(y)) > e. In particular, 
for 8 = 1/n, there is x n £ S with g(x n ,x) < 1/n but g(h(x n ), h{x)) > e. This 
means that the sequence (x n ) converges to x, but the image sequence (h(x n )) 
does not converge to h(x). So the convergence property fails. □ 

Exercise 2.2.10. If /: Si — > S 2 is continuous and g: S 2 — > S 3 is continuous, 
then the composition g o f : Si — > S 3 is also continuous. 
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Let (a; n ) ne N be a sequence. Suppose we choose an infinite subset of the 
positive integers, and list them in order: 


k\ < k 2 < < ■ ■ ■ . 

Then we may form a new sequence 

This is called a subsequence of (x n ). 

Let (x n ) be a sequence in a metric space S and let x € S. We say that x 
is a cluster point of the sequence (x n ) iff for every e > 0, and every TV £ N, 
there exists n > TV with g(x n ,x) < e. 

Proposition 2.2.11. The point x is a cluster point of the sequence (x n ) if 
and only if x is the limit of some subsequence of (x n ). 

Proof. Suppose x is a cluster point of (x n ). We will define integers k\ < k 2 < 

■ ■ ■ recursively. Now 1 > 0, so there is n with g{x n ,x) < 1. Let k\ be such 
an n. Then 1/2 > 0, so there is n > k± + 1 with g(x n ,x) < 1/2. Let k 2 
be such an n. Suppose kj has been defined. Then l/(j + 1) > 0, so there is 
n > kj + 1 with g(x n ,x) < l/(j + 1). Let kj + 1 be such an n. So, we get a 
sequence k\ <k 2 < ■ ■ ■ such that g(xk j , x ) < 1 jj for all j. Thus x is the limit 
of the subsequence of (x n ). 

Conversely, suppose x is the limit of the subsequence (a of (x n ). Let 
e > 0. Then there is J £ N so that g(xk d ,x) < e for j > J. If TV £ N, then 
there is kj with both j > J and kj > TV. So we have g(xk j ,x) < e. Therefore 
a; is a cluster point of the sequence (x n ). □ 

Exercise 2.2.12. Let A be a subset of a metric space S. Then the following 
are equivalent: 

(1) A is closed; 

(2) If x G S and there is a sequence (x n ) in A such that x n — > x, then x £ A. 

Exercise 2.2.13. Let A be a subset of a metric space S. Then the following 
are equivalent: 

(1) A is open; 

(2) For every x € A and every sequence (x n ) in S such that x n — > x, there 
exists TV G N so that x n € A for all n > TV. 


Completeness 

A Cauchy sequence in a metric space S' is a sequence (x n ) satisfying: for 
every e > 0 there is TV g N so that g(x n , x m ) < e for all n, m with n> TV and 
m > TV. This is an important definition for the real line R. A sequence in M is 
convergent if and only if it is a Cauchy sequence. For a general metric space, 
only one direction remains true in general: 
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Proposition 2.2.14. Every convergent sequence is a Cauchy sequence. 

Proof. Suppose x n — > x. I will show that (x n ) is a Cauchy sequence. Let e > 0. 
Then also e/2 > 0. Since x n — > x, there is TV € N such that g(x n ,x) < e/2 for 
all n > N. Then, if n, m > N, we have 

£ £ 

g(x n ,x m ) < q(x„,x) + g(x,x m ) < ^ + ~ = £■ 

Therefore (x n ) is a Cauchy sequence. □ 

Consider the metric space S consisting of all the rational numbers. The 
number \/2 is irrational. But if its decimal expansion is truncated after n 
places, the result is a rational number: 

Xi = 1.4 
x 2 = 1.41 
x 3 = 1.414 
x 4 = 1.4142 

and so on. This is a Cauchy sequence in the metric space S that does not 
converge in S. The theorem of Cauchy is so useful, however, that we will 
single out those metric spaces where it is true: A metric space S is called 
complete iff every Cauchy sequence in S converges (in S). 

Exercise 2.2.15. Three-dimensional Euclidean space R 3 is complete. 

Exercise 2.2.16. Suppose S is an ultrametric space. Then a sequence (x n ) 
in S' is a Cauchy sequence if and only if g(x, i,x„+i) — > 0. 

Proposition 2.2.17. The space of infinite strings from the alphabet 

{0,1} is complete under the metric gi/ 2 - 

Proof. Let (cr n ) be a Cauchy sequence in We first define a “candidate” 
r for the limit, then we prove that it is, in fact, the limit. For each k, there is 
rife £ N so that for all n, m > nk, we have 

Qi j n ^ er jyf) 

That means that o nk (A: = a m \k for all m > nk- Define r as follows: The fcth 
letter of r is the kth letter of o nk . So r satisfies 

r\k = o nk f/c 



for all k. 

To see that o n — > r, let e > 0 be given. Choose k so that (l/2) fc < e. Then 
for to > rife, we have a m \k = r\k , so gi/ 2 {(J m ,r) < (1/2) fe < e. This shows 
that ex n — i t . □ 
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Exercise 2.2.18. Completeness is a metric property, but not a topological 
property. 

The closure of a set A is the set A, consisting of A together with all of its 
accumulation points. It is a closed set. A set A is dense in a set B iff A = B. 
It may be useful to verify that this agrees with the definitions given above for 
R (p. 6) and R 2 (p. 10). 

A point x that belongs both to the closure of the set A and to the closure of 
the complementary set S\ A is called a boundary point of A. The boundary 
of A is the set of all boundary points of A. We will write it as dA. 

A set has empty boundary if and only if it is clopen.* The boundary of the 
interval (a, b) in the metric space R is the two-point set {a, b}. The boundary of 
the ball B r (x) in R d is the sphere {j/eR^: \x — y| = r }. In a general metric 
space S, however, we can only say that the boundary of the ball B r {x) is a 
subset of the sphere { y £ S : g(x,y) = r }. For example, if S is ultrametric, 
then the boundary of B r {x) is empty, even if the sphere is not. 

Exercise 2.2.19. If A is any subset of a metric space, then dA is a closed set. 

Suppose T C S. Then T is a metric space in its own right. If A C T, then 
also ACS. The boundary of A is a concept that depends on the complement 
of A, as well as on A itself, so it makes a difference whether the boundary is 
taken in T or in S. For example, S = R contains T = [0, 1]. If A = [0, 1/2], 
then OtA = {1/2}, but d$A = {0, 1/2}. 

Theorem 2.2.20. Let S be a metric space, ACS, and T C S. Then &t(AC I 
T) C d s A. 

Proof. Let x £ dr {A (IT). For every e > 0, there exist points y £ A AT with 
g(y, x) < e and points z £ T \ {A n T) with g(z, x) < e. But points such as y 
are points of A, and points such as z are points of S \ A. So x is a boundary 
point of A in S'. □ 


Contraction Mapping 

A point x is a fixed point of a function / iff f(x) = x. A function / : S — > S 
is a contraction iff there is a constant r < 1 such that 

0(f(x),f(y)) < rg(x,y) 

for all x,y £ S. A contraction is easily seen to be continuous. There is a useful 
theorem on contraction mappings. 

Theorem 2.2.21 (Contraction mapping theorem). A contraction map- 
ping f on a complete nonempty metric space S has a unique fixed point. 

* “Clopen” means “closed and open” 
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Proof. First, there is at most one fixed point. If x and y are both fixed points, 
then g(x, y ) = g(f(x),f(y)) < rg(x , y). But 0 < r < 1, so this is impossible if 
g(x,y) > 0. Therefore g(x,y) = 0, so x = y. 

Now let Xq be any point of S. (Recall that S is nonempty.) Then define 
recursively 

x n +i = f{x n ) for n > 0. 

I claim that (x n ) is a Cauchy sequence. Write a = g(x o,Xi)- It follows by 
induction that g(x n+ i, x n ) < ar n . But then, if m < n, we have 


n— 1 


n— 1 


g( x m ,x n ) < ^2 e( x j+i’ x j) < 


ar m - ar n ar m ( 1 - r n ~ m ) 


1 — r 


1 — r 


< 


ar 

1 — r 


Therefore, if e > 0 is given, choose N large enough that ar N /( 1 — r) < e. 
Then, for n,m> N, we have g(x m ,x n ) < e. 

Now S is complete and (x n ) is a Cauchy sequence, so it converges. Let x be 
the limit. Now / is continuous, so from x n — > x follows also f(x n ) — > f(x). But 
f{x n ) = x n+ 1 , so f(x n ) — > x. Therefore the two limits are equal, x = f(x), so 
a; is a fixed point. □ 


The preceding theorem can be used to prove the existence of certain points 
in a complete metric space. But more than that is true: the proof of the 
theorem shows a way to “construct” the point in question. We record this 
consequence of the proof. 


Corollary 2.2.22. Let f be a contraction mapping on a complete metric space 
S. If Xq is any point of S, and 


x n+l = f(x n ) for n> 0, 


then the sequence x n converges to the fixed point of f. 



Fig. 2.2.21. Illustration for Corollary 2.2.22. 
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A function /: S — > T is a Lipschitz function iff there is a constant £? 
with 

e(f(x),f(y)) < £>(x, y) for all x,y £ S. 

Exercise 2.2.23. Suppose / is a Lipschitz function. Does it follow that / is 
continuous? 

Let A be a nonempty set in a metric space S. Then for x £ 5, the distance 
dist({x}, A) satisfies the Lipschitz condition 

|dist({x},A) — dist({y}, A) | < g(x,y). 

We will usually write dist(x, A) for dist({x}, A). 

A function / : S — > T is inverse Lipschitz iff there is a constant A > 0 
with 

e(f(x),f(y)) > Ag(x,y) for all x,y £ S. 

Exercise 2.2.24. Suppose / is an inverse Lipschitz function. Does it follow 
that / is continuous? 

A function f: S — > T is a lipeomorphism iff it is both Lipschitz and 
inverse Lipschitz. That is, there are positive constants A , B such that 

Ag{x,y) < g(f(x),f(y)) < B g(x,y). 

Such a function / is also called a metric equivalence. 

Exercise 2.2.25. Suppose / is a lipeomorphism. Does it follow that / is a 
homeomorphism? 

Exercise 2.2.26. Find an example of a sequence F n of closed sets in R such 
that U N F n is not closed. 

Exercise 2.2.27. Suppose that, for each n £ Z, we have a closed set F n C 
[n, n + 1]. Then show that |J )igZ F n is closed. 

Separation 

An important consideration for topological dimension will be “separation” of 
sets. Here is our first theorem of this type. 

Theorem 2.2.28. Let A and B be disjoint closed subsets of a metric space 
S. Then there exist disjoint open sets U and V in S with U A A and V D B. 

Proof. Write U = { x £ S : dist(x, A) < dist(x, B)}. By the triangle inequal- 
ity, we have 

| dist(x, A) — dist(y, A)| < g(x,y), 

so that dist(x, A) is a continuous function of x. Similarly, dist(x, B) is a con- 
tinuous function of x. It follows that U is an open set. Since A is closed, we 
have dist(x, A) = 0 if and only if x £ A. So if x £ A, we have dist(x, A) = 0 < 
dist(x, B)\ this shows that A CU. Let V = {x £ S : dist(x, A) > dist(x, B) }. 
As before, V is open and B C V. Clearly U fl V = 0. □ 
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Fig. 2.2.28. Illustration for Theorem 2.2.28 


This result can be rephrased in another form: 

Corollary 2.2.29. Suppose F is closed and U is open. If F QU, then there 
is an open set V with F C V C V C U . 

Exercise 2.2.30. The same idea of proof can be used to prove some variants 
of the theorem: 

(1) Let A and B be subsets of a metric space S. Suppose APiB = 0 = ACiB. 
Then there exist disjoint open sets U and V in S with U A A and V A B. 

(2) Let A and B be disjoint closed subsets of a metric space S. Then there 
exist open sets U and V in S with U A A, V A B 1 and JJ (1 V = 0. 
Rephrase this in the manner of Corollary 2.2.29. 


2.3 Separable and Compact Spaces 

Sometimes it is best to deal not with the most general metric space, but a 
more specialized class of metric spaces. This section deals with two important 
special classes of metric spaces. 


Separable 

A family U of subsets of S is said to cover a set A iff A is contained in the 
union of the family IX. A family which covers a set is known as a cover of 
the set. A cover consisting of a finite number of sets is called a finite cover. 
A cover consisting of a countable number of sets is called a countable cover. 
An open cover of a set A is a cover of A consisting only of open sets. If U is 
a cover of A, then a subcover is a subfamily of U that still covers A. 

Theorem 2.3.1. Let S be a metric space. The following are equivalent: 

(1) There is a countable set D dense in S . [S' is a separable space.] 

(2) There is a countable base for the open sets of S . [the second axiom, of 
countability] 
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(3) Every open cover of S has a countable subcover. [the Lindelof 
property] 

Proof. (1) => (2). Suppose S admits a countable dense set D. Let B = 
{ Bi / n (a) : a £ D,n £ N }. Then B is a countable family of open sets. I claim 
it is a base of the open sets of S. Let U be any open set of S, and let x £ U. 
Then there is e > 0 so that B e { x) C U. Choose n so that 2 /n < e. Since D is 
dense in S, we know that x £ D. So there is a point a £ B i/ n (x) D D. Then 
B\/ n {a) £ B, and we have x £ B 1 / n (a) C B 2 / n (x ) C U. Thus B is a countable 
base for the open sets of S. 

(2) => (3). Suppose there is a countable base B for the open sets of S. 
Let U be an open cover of S. For each point x £ S, choose a set U x £ U 
with x £ U x . Then choose a basic set D x £ B with x £ D x C U x . Now 
{ D x : x £ S } is a subfamily of B, so it is countable. So it has the form 

{D x : i£S} = { D Xn : n £ N } . 

Now write V = { U Xn : n £ N}. This is a countable subfamily of 11. If x £ S, 
then D x = D Xn for some n , and therefore x £ D Xn C U Xn . So V is a countable 
subcover. 

(3) => (1). Suppose that S has the property of Lindelof. For n £ N, the 
collection 

= { B 1/n (x) : x £ S} 

is an open cover of S. Therefore it has a countable subcover, say 
•A-n = { Bi / n (y) ■ y £ Y n } , 
for a countable set Y n . Let 

D = (J Yn. 

nGN 

Then D is countable, since it is a countable union of countable sets. If x £ S 
is any point, and e > 0, choose n with 1 jn < e. Since A n is a cover of S, there 
is y £Y n C D with x £ Bi/ n {y). Therefore y £ Bi/ n (x) C B e ( x). This shows 
that D is dense in S. □ 

A metric space S will be called separable iff it has one (and therefore all) 
of the properties of Theorem 2.3.1. A subset of a metric space will be called 
separable iff it is a separable metric space when considered to be a metric 
space in its own right. Separable metric spaces have many useful properties. 
By (2), any subset of a separable metric space is separable. By (1), a union 
of countably many separable sets is separable. All of the examples of metric 
spaces in this book are separable: 

Exercise 2.3.2. The set 

{ (xi,x 2 , ■ ■ ■ , Xd ) £ : all x ? are rational numbers } 

is countable and dense in Euclidean space R d . 
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Exercise 2.3.3. Consider the space of infinite strings from the alphabet 
{0, 1} under the metric gi/ 2 . The set 



is a countable base for the open sets. In fact, every open ball is one of the 
sets [a]. 

Compact 

We will begin by considering compactness of a closed interval [a, b] in R. 

Theorem 2.3.4 (The Bolzano— Weierstrass theorem). Let a < b be real 
numbers. If (x n ) is any sequence in the interval [a, b], then ( x n ) has at least 
one cluster point. 

Proof. We will define recursively a sequence (/*,) of closed intervals, such that 
each Jfc contains x n for infinitely many n. Let /o = [1,6]. Then Iq contains x n 
for all n. If Ik has been defined, say Ik = [cifc,6fc], then we will consider the 
midpoint Ck = (a*, + bk)/ 2. Since Ik contains x n for infinitely many n, either 
the left half [ak,c k \ or the right half [ck,bk] also contains x n for infinitely 
many n.Let I k +i be a half that contains x n for infinitely many n. Now also 
note from the definition that the length of Ik+i is half the length of Ik, so the 
length of Ik is 2 ~ k (b— a). This converges to 0. Also note that I k +i C I k . This 
means that a m G Ik for all m > k. So (a k ) is a Cauchy sequence, and hence a 
convergent sequence. Let x be its limit. The interval I k is closed, so the limit 
x is in I k . I claim that £ is a cluster point of ( x n ). If e > 0, choose k so that 
the length of I k is less than e. Now x n is in I k for infinitely many n, so if N 
is given, then there is n > iV with x n £ I k - Also x £ I k , so \x n — x\ < e. Thus 
(x n ) has a cluster point. □ 

A metric space S is called sequentially compact iff every sequence in S 
has at least one cluster point (in S). 

Let r > 0. A subset A of a metric space S is an r-net for S iff every 
point of S is within distance at most r of some element of A. For example, 
the countable set { rn : n £ Z } is an r-net in R. 

Proposition 2.3.5. Let S be a sequentially compact metric space, and let 
r > 0. Then S has a finite r-net. 

Proof. Suppose S has no finite r-net. 

We will define a sequence (x n ) recursively, with g(x n ,x m ) > r for all 
m ^ n. First, S ^ 0 (since 0 is a finite r-net in 0). So we may choose 
x\ £ S. Now, assume x\, x 2 , ■ ■ ■ , x n have been chosen. Since {xi, x 2 , • • • , x n } 
is not an r-net, there exists a point (call it x n +i) such that g(xj,x n+ 1 ) > r 
for 1 < j < n. This completes the definition of the sequence (x n ). 
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Now I claim that this sequence (x n ) has no cluster point. If x were a 
cluster point, then the ball B r / 2 (x) would contain at least two of the points 
x n , which is impossible since they have distance exceeding r. Therefore S is 
not sequentially compact. □ 

Corollary 2.3.6. A sequentially compact metric space is separable. 

Proof. Suppose S is sequentially compact. For each n let D n be a finite 1/n- 
net for S. Then D = (J ngN D n is a countable set dense in S. □ 

Proposition 2.3.7. Let a < b be real numbers. If A is any infinite subset of 
the interval [a, b], then A has at least one accumulation point. 

Proof. If A is an infinite set, we may choose a sequence x n £ A of distinct 
elements. By Theorem 2.3.4, (x n ) has a cluster point x. If e > 0, then x n £ 
B e {x) for infinitely many n, so x n £ B e (x) for some x n ^ x. This shows that 
x is an accumulation point of A. □ 

A metric space S is called countably compact iff every infinite subset of 
S has at least one accumulation point (in S). Let J be a family of subsets 
of a set S. We say that T has the finite intersection property iff any 
intersection of finitely many sets from T is nonempty. 

Theorem 2.3.8 (The Heine Borel theorem). Let a < b be real numbers. 
Let T be a family of closed subsets of the interval [a, b}. If T has the finite 
intersection property, then the intersection 

rw 

Fes' 


of the entire family is not empty. 

Proof. First, by Exercise 2.3.2, the line 1R is separable; therefore also [a, b\ is 
separable. Suppose (for purposes of contradiction), that 

f|F = 0. (1) 

Fe? 

That means that { [a, 6] \ F : F £ T } is an open cover of [a, b\. So by the 
Lindelof property (Theorem 2.3.1), there is a countable subcover. That means 
that there is a countable number of the sets of T with empty intersection. Say 

fl F n = 0, (2) 

raeN 


where F n £ T. Now, for each n, the finite intersection Fi fl F% fl • • • fl F n is 
nonempty; choose an element x n . By sequentialcompactness (Theorem 2.3.4), 
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the sequence x n has a cluster point, say x. Since F n is closed and x m £ F n 
for all m > n, we have x £ F n . This is true for all n, so 


x £ P| F n , 


which contradicts (2). This contradiction arose from assuming (1). Therefore 



□ 




A metric space S is (temporarily) called bicompact iff every family of 
closed sets with the finite intersection property has nonempty intersection. As 
in the proof above, simply taking complements shows that this is equivalent 
to saying that every open cover of S has a finite subcover. 

Theorem 2.3.9. Let S be a metric space. The following are equivalent: 

(1) S is sequentially compact, 

(2) S is countably compact, 

(3) S is bicompact. 

Proof. (3) => (2). Suppose S is not countably compact. Then there is an 
infinite subset A of S' with no accumulation points. For each point x £ S, 
choose an open ball B x such that B x contains no points of A (except possibly x 
itself) . Then U = {B x : x £ S } is an open cover of S. Any finite subcollection 
of U contains only finitely many points of A, so II does not admit a finite 
subcover. So S is not bicompact. 

(2) ==> (1). Suppose S is countably compact. Let (x n ) be any sequence 
in S. If there is a point x with x n = x for infinitely many n, then that x 
is a cluster point of the sequence ( x n ). On the other hand, if there is no 
such point, then the set A = {x n : n € N } of values is an infinite set; so A 
has an accumulation point, which is easily seen to be a cluster point of the 
sequence (x n ). So in all cases (x n ) has a cluster point. Thus S is sequentially 
compact . 

(1) => (3). Suppose S is sequentially compact. Then by Corollary 2.3.6, 
S is separable. The rest of the proof follows the proof of Theorem 2.3.8 word 
for word. □ 

A metric space S will be called compact iff it has one (and therefore all) 
of the properties of Theorem 2.3.9. A subset of a metric space will be called 
compact iff it is a compact metric space when considered to be a metric space 
in its own right. One of the most useful ways to prove that a set is compact 
is the following: 


Proposition 2.3.10. A closed subset of a compact space is compact. 
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Proof. Suppose S is compact and T C S is closed. Let (x n ) be a sequence 
in T. Now by the compactness of S , there is a : £ S which is the limit of 
a subsequence ( Xk t ) of (x n ). But T is closed and Xk t € T for all i, so the 
limit x is also in T. Thus (x n ) has a cluster point in T. This shows that T is 
compact . □ 

Proposition 2.3.11. Let A C R d . Then A is compact if and only if A is 
closed and bounded. 

Proof. First, suppose A is closed and bounded. Then A is a subset of a large 
cube, 

C = { x = (xi, X 2 , ■ ■ ■ , Xd) '■ — a < Xj < a for all j } . 

By Proposition 2.3.10, if I show that C is compact, it will follow that A is 
compact. Now let ( y n ) be a sequence in C . For notation, write 

Un ( Unh Vn 2 5 5 Und)- 

Now the sequence of first coordinates (y n i)n£N is a sequence in [—a, a], which 
is compact. So there is a subsequence that converges, so there is an infinite 
set Ni = {nx < n 2 < • ■ • } and 

lim y n 1 = zi. 

raGiVi 

Next, the sequence of second coordinates ( y n 2 )neN 1 is a sequence in [—a, a], 
which is compact. So there is a subsequence that converges, say N 2 C iVi and 

lim y n2 = z 2 - 
neN 2 

Similarly, we get N 3 D ■ ■ ■ D Nd , with 

iim y nj = zj. 

n(zNj 

Finally, the subsequence (y n )n£N d has all coordinates convergent, and its limit 
is z = (zi, z 2l • • • ,Zd) £ C. This proves that C is compact. 

Conversely, suppose that A is compact. If A is unbounded, then 

{B„(0)nA: n£N} 

is an open cover of A with no finite subcover. If A is not closed, there is an 
accumulation point x of A that is not in A. So there is a sequence (x n ) in A 
converging to x. This sequence has no cluster point in A. □ 

Exercise 2.3.12. The metric space of all infinite strings constructed 

using the alphabet E = {0, 1} is compact under the metric Qi/ 2 . 

Of course compactness is a topological property: If S and T are homeo- 
morphic, and S is compact, then T is compact. 

Exercise 2.3.13. A compact subset of a metric space is closed. 

Exercise 2.3.14. The union of finitely many compact sets is compact. 
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Image and Inverse Image 

If / : S — > T is a continuous function, and ACS, some properties of the set 
A are related to properties of the image set 

f[A\ = {/( x) : x G A} . 

Theorem 2.3.15. Let f : S — > T be continuous. Let AC S be compact. Then 
f[A] is compact. 

Proof. Let (y n ) be a sequence in f[A], Then there exist points x n £ A with 
/( x n ) = y n . By the compactness of A, there is a subsequence (x^) that 
converges to some point x £ A. But since / in continuous, this implies that 
Uki = }{x ki ) -> f(x). Now f(x) S f[A], so (y n ) has a cluster point in f[A). 
This shows that f[A\ is compact. □ 

Corollary 2.3.16. Let S be a compact metric space, and let the function 
f : S — > R be continuous. Then f is bounded; that is: there is B £ K such 
that |/(x)| < B for all x £ S. 

Proof. By Theorem 2.3.15, f[S) is a compact subset of R. By Proposi- 
tion 2.3.11, it is a bounded set. □ 

Let / : S — > T be continuous. Properties of a set B CT may be related to 
properties of the inverse image 

f- 1 [B\ = {xeS: f(x) SB}. 

Proposition 2.3.17. Let f:S-^>T be continuous. Consider a set B C T. 
Then: 

(1) If B is open, then f _1 [B] is open. 

(2) If B is closed, then f _1 [B] is closed. 

Proof. (1) follows from Theorem 2.2.6. (2) follows from (1) by taking comple- 
ments. □ 

Recall that dist(T, B) = 0 can happen if A ^ B. This happens, for exam- 
ple, if AnB ^ 0. Or even if A and B have an accumulation point in common. 
But it can happen in other ways too: 

Exercise 2.3.18. Give an example of two disjoint nonempty closed sets with 
dist(T, B) = 0. 

If one of the sets is compact, then we do get positive distance. 

Theorem 2.3.19. If A is closed , B is compact, and AnB = 0, then 
dist(T, B) > 0. 

Proof. Suppose dist(T, B) = 0. Then there exist points x n £ A and y n £ B 
with g(x n ,y n ) < 1/n. Now B is compact, so (by replacing the sequences with 
subsequences) we may assume that (y n ) converges. Say y n — > y £ B. Then 
x n -s- y, also. But A is closed, so y £ A. Therefore 4flB/0. □ 

For A C S and x £ S, recall that we write dist(x,T) = dist({x}, A). Now 
{x} is a compact set, so we have x € A if and only if dist(x, A) = 0. 
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Uniform Continuity 

Let f : S — > T be a function. Let us recall a definition: / is continuous iff, for 
every x £ S and every e > 0, there exists S > 0 so that f[Bg(x)\ C B e (f(x)). 
Compare that to this definition: / is uniformly continuous iff, for every e > 
0, there exists S > 0 so that for every x £ S, we have f[Bg(x)] C B e (/(x)). 
What is the difference: in the first case, (5 is allowed to depend not only on s, 
but also on x: but in the second case, while S still depends on e, it does not 
depend on x. 

Clearly every uniformly continuous function is continuous. But the con- 
verse may fail: 

Exercise 2.3.20. Let /: R — > R be defined by f(x) = x 2 . Then / is continu- 
ous, but not uniformly continuous. 

However, when S is compact, then the two kinds of continuity are the 
same: 

Theorem 2.3.21. Let S be a compact metric space, let T be a metric space, 
and let f : S — > T be a function. If f is continuous, then f is uniformly 
continuous. 

Proof. Let e > 0 be given. Then by continuity, for every x £ S, there is a 
positive number 8(x) such that f[Bg^(x)] C B E / 2 (f(x)). Now the collection 

{ B &(x)/i(x) : ifS} 

is an open cover of S. By compactness, there is a finite subcover, say 
(*£l) j Bg( X2 )i 2 (x 2 ) , , Bg^ Xn ^j 2 (x n ) . 

Let 6 = min{<5(xi)/2, S(x 2 )/2, ■ ■ ■ ,S(x n )/2}. I claim that for this S, and any 
x £ S, we have f[Bg(x)] C H e (/(x)). Indeed, suppose y £ Bg(x). For some i, 
we have y £ B S ( x .y 2 (xi). Therefore g(y, xf) < <5(xi)/2, so e(f(y),f{xi)) < e/2. 
But also g{x,Xi) < g{x,y) + g(y,x *) < 8(xi), so g(f(x),f(xi)) < e/2. So 
we have g(f(x),f(y)) < e. That is, we have f[Bg(x)] C H e (/(x)), so / is 
uniformly continuous. □ 

Theorem 2.3.22. Let S be a compact metric space, and let 11 be an open 
cover of S . Then there is a positive number r such that for any set ACS 
with diamH < r, there is a set U £ U with ACU. 

Proof. First, there is a finite subcover, say {Ui ,U 2 , - ■ ■ ,U n }. Suppose the 
assertion is false. Then for every k £ N, there is a set A *. of diameter at 
most 1/k that is not contained in any 11,,. So there are points Xik £ Ak \ Ui, 
(1 < i < n, k £ N). By taking subsequences, we may assume that limj, x^ = Xi 
exists for each i. But the points Xi have distance 0 from each other, so they 
are all equal. This point Xi does not belong to any set Ui, contradicting the 
fact that the sets Ui cover S. □ 
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The largest number r with the property of the Theorem is known as the 
Lebesgue number of the cover II. 


Number Systems 

Recall the situation from Sect. 1.6. Let & be a complex number, \b\ > 1, and 
let D be a finite set of complex numbers, including 0. We are interested in 
representing complex numbers in the number system they define. 

Write F for the set of “fractions” ; that is numbers of the form 

OO 

5>,^> a j £ D. 

1=1 


Proposition 2.3.23. The set F of fractions is a compact set. 

Proof. Let A C F be infinite. For each digit d £ D, write 

,oi = d I 

Then we have A = Udeu A(d)\ s i nce A is infinite, at least one of the sets A(d) 
is also infinite. Choose d\ £ D so that A(d{) is infinite. Then write 


A(d) = 


£ A: x = 


OO 

x = J2 a i 

l=i 


A(d 1} d) 


£ A: x = 


£■ 

1=1 


jb 3 , a\ = di, 02 = d 


There is d 2 £ D so that A(d\,d 2 ) is infinite. We may continue in this way: we 
obtain a sequence (dj) of digits so that A(di,d 2 , • • • ,dk) is infinite for all k. 
Then the number 

OO 

£<*!&-* 

l=i 

is an accumulation point of the set A. This shows that F is compact. □ 


2.4 Uniform Convergence 

In Chap. 1 I had several occasions to mention “convergence” of a sequence 
of sets. There are two ways that will be used to make that idea precise. The 
first way, in this section, is “uniform convergence” of functions. The second 
way, in Sect. 2.5, is the metric of Hausdorff. In both cases, the most natural 
setting is that of an appropriate metric space. This is one of the reasons that 
we have been considering abstract metric spaces, rather than just subsets of 
Euclidean space. 
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Uniform Convergence in General 

Let S and T be two metric spaces. We will be considering functions / : S — * T. 

Let f n be a sequence of functions from S to T, and let / be another 
function from S to T . The sequence f n converges uniformly (on S) to the 
function / iff for every e > 0, there exists N £ N so that for all x £ S and all 
n > N, we have g(fn(x), f(x)) < e. 

This definition may be rephrased as convergence for a metric. We will say 
that two functions / and g are within uniform distance r of each other iff 

P(fi x )i9( x )) < r for all x £ S. 

Then we write g n (f,g) < r. More precisely, g u (f,g) is the smallest number r 
that works: 

Qu(f,g) = svp{g(f(x),g(x)): xgS}. 

Uniform convergence makes sense even if the functions are not continuous. 
Here is the most important property of uniform convergence: 

Theorem 2.4.1. Suppose /„: S — > T is a sequence of functions from the 
metric space S to the metric space T. Suppose f n converges uniformly to a 
function f. If all functions f n are continuous, then f is continuous. 

Proof. Let x £ S be given. I must show that / is continuous at x. So let 
a positive number e be given. I must find a corresponding value 5. Now /„ 
converges to / uniformly, and e/3 is a positive number, so there exists N £ N 
such that for all n> N, and all y £ S, we have g(f(y),f n (y)) < e/3. Once N 
is known, we can use the fact that the single function fjy is continuous at x to 
conclude that there exists S > 0 so that g(y, x) < 6 implies g(fN(x), /iv(y)) < 
e/3. But then, for any y with g(x,y) < 6, we have 

Q(f(x),f(y)) < g{f{x)jN{ x )) + g(fN{ x )jN{y)) + g(fN{y),f(y)) 



This shows that / is continuous at x. □ 

If the set of functions we deal with is properly restricted, then g u will be a 
metric. This is useful, since then all of the work we have done on metric spaces 
will be relevant. If S and T are metric spaces, we will write 6 (S,T) for the 
set of all continuous functions from S to T. 

Theorem 2.4.2. Let S be a compact metric space, and let T be a metric 
space. Then g u is a metric on 6(5, T). 

Proof. First, g n (f, g) > 0 and g u (f, g) = g u (g, f) are clear. Since 5 is compact, 
we have g u (f,g ) < oo, as in Corollary 2.3.16. If / = g, then g u {f,g ) = 0. If 
g u (f,g) = 0, then 0 < g(f(x),g(x)) < 0 for all x £ S, so f(x) = g(x) for all 
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x £ S, which means / = g. Finally, I must prove the triangle inequality. For 
any x £ S, we have 

Q{f{x),h(x)) < g(f(x),g(x)) + g(g(x),h{ x)) < g u (f,g) + e u (g,h). 

Therefore / and h are within uniform distance g u (f, g) + Qu(g, h) of each other, 
so g u (f,h) < g u (f,g) + g u {g,h). □ 

The metric g u will be called the uniform metric. 

It makes sense to ask about the metric properties of the metric space 
C (5, T). Completeness will be a very useful property. 

Theorem 2.4.3. Suppose S is a compact metric space , and T is a complete 
metric space. Then the metric space G(S,T) is complete. 

Proof. Let (f n )neN be a Cauchy sequence in 6(5, T). Let x £ S. Then we 
have 

Q{fn(x), fm(x)) < Qu(fn,fm), 

so (/ n (i))„g n is a Cauchy sequence in T. Since T is complete, this Cauchy 
sequence converges. Call its limit f{x). This construction is valid for each 
x £ S, so it defines a function f: S — > T. I must show that f n converges 
uniformly to /. Let e > 0 be given. There is N £ N so that g u {fn , fm) < e/2 
for all n, m > N. For any x £ S, there is N x £ N so that g(f n 0),/ 0)) < e/2 
for n > N x . We may assume N x > N. Therefore, for any n > N, and any x, 
we have 


e(fn(x),f(x)) < e(fn(x), f Nx (x)) + q{/n x (x), f(x)) < £. 

Therefore g u {fn, f) < e. This proves that f n converges uniformly to /. By 
Theorem 2.4.1, the limit / is continuous, / £ C (5, T). So the space G(S,T) is 
complete under the uniform metric g u . □ 

Exercise 2.4.4. Under what conditions on S and T is the metric space 
C (S,T) ultrametric? 

Exercise 2.4.5. Under what conditions on S and T is the metric space 
C (5, T) compact? 

Continuous Curves 

Suppose the metric space T is a convex subset of a Euclidean space R k , and 
S' is a Euclidean space K d . A function /: T — > S is affine iff it satisfies 

f{tx + (1 - t)y) = tf(x) + (1 - t)f(y) 

whenever x,y £ T and 0 < t < 1. 
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Proposition 2.4.6. An affine map f of an interval [u, u] into R. must be of 
the form /( x) = mx + b, for some 

Proof. If u = v, then of course f(x) = f(u) for all x £ [u, i;]. So suppose 
u < v. Now if it £ \u, u], then 


v — x x — u 

x = u H v, 

v — u v — u 


which is of the form tu + (1 — t)v with 0 < t < 1. So 


f(x) 


v — X 


v — u 


f(u) + 


x — u 


v — u 


f(v) 


f(v) - f(u) __ , vf(u) - uf(v ) 

X I 5 

v — u v — u 


which has the required form. □ 

Exercise 2.4.7. An affine map / of an interval [it, i;] into R d must be of the 
form f(x) = xm + b , for some m, b £ R d . 

A continuous curve in a metric space S' is a continuous function 
/: [0,1] — ► S. (Illustration in Fig. 2.4.8.) Sometimes we will use the term 
to refer to the range /[[0, 1]] of such a function. 

A continuous curve / : [0, 1] — > is piecewise affine iff there exist 
“division points” 0 = Oo < ai < ■ • • < a n = 1, such that / is affine on each of 
the subintervals [a ; _i , a 3 \. The range of such a function is called a polygonal 
curve. 

Consider the dragon construction of the Koch curve (p. 18). There is a 
sequence (Pk) of polygonal curves. This sequence is supposed to “converge” 
to the Koch curve P. Since we do not have Pk+i Q Pk, it is not reasonable to 
take the intersection. Here is one way to produce a reasonable limit. 

The curve Pk consists of 4 fc line segments. Divide the interval [0, 1] into 
4 fc subintervals of equal length. Define a function gk : [0, 1] — > K 2 , so that it 



Fig. 2.4.8. Continuous curve 
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is affine on each of these subintervals, and continuous on all of [0, 1], so that 
the subintervals are mapped to the line segments of Pk . (To be explicit, start 
with gk(0) at the left end of Pk-) Thus we have gk [[0, 1]] = Pk- 

Proposition 2.4.10. The sequence (gk), in the dragon construction of the 
Koch curve, converges uniformly. 

Proof. First note that every point of Pi is within distance 1 of every point of 
Pq. So certainly g u (go,gi) < 1- Now we estimate g u (gk,gk+ 1 ). Let t G [0,1]. 
Then t is in one of the 4 fc subintervals J on which gk is affine. Its image gk[J] 
is a line segment of length 3 -fe . When gk+i is constructed, the interval J is 
subdivided into 4 equal intervals, and a miniature copy of g\ is used there 
(properly translated, rotated, and dilated by 3~ fe ). So g(gk(t), gk+i(t)) < 
3~ k g(go,gi). Therefore g u (g k ,gk+ i) < 3 -fc . 

But then for m > k we have 

m— 1 m — 1 

gu(gk, g-m) < y ' Qu(gj , 9 j+i) y ] 3 ^ < 3 ~ l ~ / 2 . 

j=k j=k 

This shows that (gk) is a Cauchy sequence for the uniform metric, so it con- 
verges uniformly by Theorem 2.4.3. □ 

Write g(t) = lim^oo gk(t) for all t £ [0, 1]. By Theorem 2.4.1, the function g 
is continuous. The limit set, the Koch curve, is then P = c/[[0, 1]] . This also 
explains the use of the word “curve’.’ 

Exercise 2.4.11. Provide similar description for Heighway’s dragon. Be sure 
to prove the uniform convergence that is required. 

Now that we have a rigorous definition for the “convergence” that is in- 
volved in the definitions, we may try to prove some of the properties of the 
dragon curves. 

Is there a uniform convergence construction for the limit of the sequence 
(Lk) on p. 5? Not in the same sense as for the dragons. The limit set (which 


70 2 Metric Topology 

is presumably the Cantor dust) is not a continuous curve. (We will prove this 
later, Proposition 3.1.6.) But there is a similar sort of uniform convergence 
construction. We must replace the interval [0, 1] with a different parameter 
space. 

Consider the space of infinite strings from the alphabet {0,1}. Define 
functions g k : -» R as follows. Begin with go(o') = 0 for all a. Then 

go [£?(">] = {0} = L 0 . Next, define gi(0cr) = 0 and < 7 i(lcr) = 2/3. Then 
gi[E^ u >] = {0,2/3} = L\. The function g\ is continuous since the sets [0] 
and [1] are open sets. We may continue recursively: If g k has been defined, 
then 


/ n x 9k(cr) 

gk+ i ( 0 < t ) = — 

n ^ g k (a) + 2 

g k +i{lcr) = 3 ■ 

The remainder is left to the reader: 

Exercise 2.4.14. These functions g^ : E^ ' 1 —* R are continuous and satisfy 
< 7 fc [.£?(“)] = L k . The sequence ( g k ) converges uniformly. 

Exercise 2.4.15. Show that the limit function g = lirngj, is the addressing 
function for the Cantor dust described on p. 14. So g[E ^ ] is exactly the 
triadic Cantor dust. 




Fig. 2.4.12. Construction for Peano curve 



Fig. 2.4.13. Peano polygons 
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Space-Filling Curves 

A continuous curve /: [0,1] — > R d , where d > 2 is called a space-filling 
curve iff / [[0, 1]] contains a ball B r (x). This possibility was realized by 
Peano, so it might be called a Peano curve. (We will see later that Heighway’s 
dragon is a space-filling curve, Proposition 2.5.9.) Here is an easy example of 
a Peano curve. It can be done as a dragon curve. To go from one stage to the 
next, each line segment is replaced by nine segments with 1/3 the length, as 
in Fig. 2.4.12. Figure 2.4.13 shows a few stages, as usual. 

Exercise 2.4.16. Prove that the limit curve is space filling. 


2.5 The Hausdorff Metric 

Felix Hausdorff devised a way to describe convergence of sets. It is (for some 
purposes) better than the one discussed in the previous section, since it does 
not require finding an appropriate parameter space and parameterizations for 
the sets. It is simply a definition of a metric that applies to sets. 


Convergence of Sets 

Let S' be a metric space. Let A and B be subsets of S. We say that A and B 
are within Hausdorff distance r of each other iff every point of A is within 
distance r of some point of B , and every point of B is within distance r of 
some point of A. 

This idea can be made into a metric, called the Hausdorff metric , D. If 
A is a set and r > 0, then the open r -neighborhood of A is 

N r (A ) = { y : g(x, y) < r for some x £ A} . 

The definition of the Hausdorff metric D: 

D(A , B) = inf { r > 0 : AC N r (B) and B C N r (A) } . 

By convention, inf 0 = oo. 

This definition D does not define a metric, however. There are various 
problems. For example, in M, what is the distance between {0} and [0,oo)7 
It is infinite. That is not allowed in the definition of metric. Therefore, we 
will restrict the use of D to bounded sets. What is the distance £>(0,{O})? 
Again, infinite. So we will restrict the use of D to nonempty sets. What is the 
distance Z?(( 0, 1), [0, 1])? Now the distance is 0, even though the two sets are 
not equal. Therefore we will restrict the use of D to closed sets. In fact for 
the purposes of this book, we will apply D only to nonempty compact sets. 

If S’ is a metric space, we will write H(5) for the collection of all nonempty 
compact subsets of S. This is called the hyperspace for S. 
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Theorem 2.5.1. Let S be a metric space. The Hausdorff function D is a 
metric on the set H(>S). 

Proof. First, clearly D(A,B) > 0 and D(A 1 B) = D(B,A). Since A and B 
are compact, they are bounded, so D(A,B) < oo. 

If A = B, then for every e > 0 we have A C N e (B)\ therefore D{A , B ) = 0. 
Conversely, suppose H, S £ H(S') satisfy D{A, B) = 0. If x £ A, then for every 
e > 0, we have x £ N e (B ), so dist(x, B) = 0. Now B is compact, hence closed, 
so x £ B. This shows A C B. Similarly B C A, so A = B. 

Finally we have the triangle inequality. Let A,B,C £ H(5). Let e > 0. 
If x £ A, then there is y £ B with g(x,y) < D(A,B) + e. Then there is 
z £ C with g(y, z ) < D(B , C) + e. This shows that A is contained in the 
( D(A , B) + D(B, C) + 2e)-neighborhood of C. Similarly, C is contained in the 
( D(A , B) + D(B , C) + 2£r)-neighborhood of A. Therefore D(A, C) < D(A, B) + 
D(B , C) + 2s. This is true for all e > 0, so D(A, C ) < D(A 1 B) +D(B, C). □ 

Here is one way to describe the limit. 

Exercise 2.5.2. Let A n be a sequence of nonempty compact subsets of S 
and let A be a nonempty compact subset of S. If A n converges to A in the 
Hausdorff metric, then 

A = { x : there is a sequence (x n ) with x n £ A n and x n — > x } . 

We may ask about the metric properties of the metric space H(S). The 
most important one will be completeness. 

Theorem 2.5.3. Suppose S is a complete metric space. Then the space 01(5) 
is complete. 

Proof. Suppose (A n ) is a Cauchy sequence in H(S'). I must show that A n con- 
verges. Let A = { x : there is a sequence (xk) with Xk £ Ak and Xk — > x }. I 
must show that D(A n ,A) — > 0 and A is nonempty and compact. 

Let e > 0 be given. Then there is N £ N so that n,m > N implies 
D(A n , A m ) < e/2. Let n > N. I claim that D(A n ,A) < e. 

If x £ A, then there is a sequence (xk) with Xk £ Ak and Xk — » x. So, 
for large enough k, we have g(xk,x) < e/2. Thus, if k > N, then (since 
D(Ak,A n ) < e/2) there is y £ A n with g(xk,y) < e/2, and we have g{y,x) < 
g{y,Xk ) + g(xk,x) < e. This shows that A C N e (A n ). 

Now suppose y £ A n . Choose integers k\ < k -2 < ■ ■ ■ so that ki = n and 
D(Ak j , A m ) < 2-Je for all m>kj. Then define a sequence (y*,) with y*. £ A k 
as follows: For k < n, choose yk £ Ak arbitrarily. Choose y n = y. If yk . has 
been chosen, and kj < k < kj+i, choose yk £ Ak with g{yk :j ■ Vk) < 2~ J e. Then 
yk is a Cauchy sequence, so it converges. Let x be its limit. So x £ A. We 
have g(y,x) = lim*, g{y,yk) < e. So y £ N E (A). This shows that A n C N e (A). 
Note that, taking e = 1 in this argument, I have also proved that d/0. 

So we have D(A,A n ) < e. This concludes the proof that (A n ) converges 
to A. 
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Next I show that A is “totally bounded”: that is, for every e > 0, there is 
a finite e-net in A. Choose n so that D(A n ,A) < e/3. By Proposition 2.3.5, 
there is a finite (e/3)-net for A n , say {j/i , J/ 2 , ■ ■ ■ , y m }- Now for each y*, there 
is Xi £ A with g(xi,yi) < e/3. The finite set {x\,X 2 ,--- ,x m } is an e-net 
for A. 

Now I will show that A is a closed subset of S. Let x belong to the closure 
A of A. Then there exists a sequence (y„) in A with g(x, y n ) < 2~ n . For each 
n there is a point z n £ A n with g{z n ,y n ) < D(A n ,A) + 2~ n . Now 

g(z n ,x) < g{Zn,Vn) + g(y n ,x) < D(A n , A) + 2“" + 2 _ ”. 

This converges to 0, so z n — * x. Thus x £ A. This shows that A is closed. 

Finally, to show that A is compact, I will show that it is countably compact. 
Let F be an infinite subset of A. There is a finite (l/2)-net B for A, so each 
element of F is within distance 1/2 of some element of B. Now F is infinite 
and B is finite, so there is an element of B within distance 1/2 of infinitely 
many elements of F. Let F\ C F be that infinite subset. The points of F\ are 
all within distance 1 of each other; that is, diam.Fi < 1. In the same way, there 
is an infinite set Fi C F\ with diarn Fi < 1/2; and so on. There are infinite 
sets Fj with diam F ? < 2 _J and F J+1 C F :/ for all j. Now if Xj is chosen from 
Fj, we have g(xj,Xf.) < 2~i if j < k, so (xj) is a Cauchy sequence. Since S is 
complete, (xj) converges, say Xj — > x. Since A is closed, x £ A. But then x is 
a cluster point of the set F. Therefore A is compact. □ 

Exercise 2.5.4. Under what conditions on S is H(S') compact? 

Exercise 2.5.5. Under what conditions on S is H(5) ultrametric? 

Convergence in the Examples 

Whenever we have used the idea of “convergence” for a sequence of sets in 
a metric space, we have been talking about nonempty compact sets. The 
Hausdorff metric is the proper way to interpret it. In fact, it agrees with the 
other interpretations that have been used. 

Proposition 2.5.6. Let A n be a sequence of nonempty compact sets, and 
suppose they decrease: Ai U A 2 U • • • . Then A n converges to the intersection 
A = PlnsN ^ n i n the Hausdorff metric. 

Proof. Let e > 0 be given. Now A C A n , so A C N e (A n ). For the other 
direction, note that the e neighborhood of A, 

N e (A) = { y : g(x, y) < £ for some x £ A } , 

is an open set. The family {iV e (H)} U { S \ A n : n £ N } is an open cover of 

Ai. Since A\ is compact, there is a finite subcover. This means that, for some 
N £ N, we have (S \ A n ) U N E (A) D A\ for all n > N . Therefore we have 

A n C N e (A). So D(A , An) < £ for all n> N. This shows that A n — > A. □ 
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Proposition 2.5.7. Suppose S is a compact metric space , T is a metric space 
and the sequence f n : S — > T converges uniformly to f. Then the image sets 
f n [S] converge to /[S'] according to the Hausdorff metric on H(T). 

Proof. This follows from the inequality Z?(/[S], /„[S]) < g u (f,f n )- □ 

Exercise 2.5.8. Let A. n be a sequence of nonempty compact subsets of S. If 
A is a cluster point of the sequence ( A n ), then A is contained in the set of all 
points x £ S for which there exists a sequence ( x n ) such that x n £ A n and x 
is a cluster point of ( x n ). 

Heighway’s Dragon Tiles the Plane 

Let us apply some of the properties of set convergence to study the Highway 
dragon fractal (p. 20). 

Proposition 2.5.9. The Heighway dragon is a space-filling curve. 


Proof. The information and notation used here are from the proof of Proposi- 
tion 1.5.7. Suppose, at some stage of the construction, the polygon P n contains 
all of the sides of a square S of the lattice L n together with all of the sides of 
the four adjoining squares, as in Fig. 2.5.9(a). Then the polygon P n+ 2 , shown 
in Fig. 2.5.9(b), contains all four sides of all of the squares contained in S, 
together with all of the sides of all their adjoining squares. 

Now the polygons P7 and Pg each contain such a square S. By induction, 
all sides of all of the subsquares of S are contained in the polygons Pfc, for 
k > 7. By Exercise 2.5.2, the limit P contains all points of the interior of the 
square S. □ 

We claim, in fact, that P tiles the plane. By this we mean: M 2 is the union 
of countably many sets, each congruent to P, and any two of them intersect 
at most in their boundaries. 

Start with the vertical lines x = k with integer k and horizontal lines y = j 
with integer j. They subdivide the plane into squares of side 1. Imagine these 


D 


Fig. 2.5.9. (a) 


(b) 
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squares classified in the checkerboard pattern as “black” and “white” so that 
squares that share an edge have opposite colors. For our tiling we begin with 
the edges of this square subdivision. Each edge is a line-segment of length 1. 
Starting with the edge, the process described for producing the dragon can be 
carried out. We need a “direction” for each edge. Let’s say the vertical edges 
have the black square on their left, and the horizontal edges have the black 
square on their right. Then each edge of this square lattice occurs once. When 
we carry out one step in the process, replacing each edge by two shorter ones, 
we will see (as in the proof of proof of Proposition 1.5.7) that the result will be 
a new square lattice in the plane, with segments l/\/2 times the length, but 
still every segment in the lattice occurs exactly once. In the limit, each edge 
from the original square lattice gives rise to one congruent copy of Heighway’s 
dragon. 

See a few stages in Plate 16. One particular edge is shown in black at 
the center, and a few of its neighboring edges are shown in in other colors. 
We know (Proposition 1.5.5) that the approximating polygons for any given 
dragon remain in a bounded region of the plane. So: given any point (x,y) 
in the plane, the squares surrounding it in the stages of the process can only 
come from a finite number of original edges. (In the plate, only a finite number 
of colors can be involved.) So there is at least one color for which a sequence in 
that color converges to the point (x, y). So (x, y) belongs to at least one of the 
dragons. It could happen that a point (x,y) belongs to two or more dragons: 
these will be boundary points for two or more dragons. Plate 17 shows nine 
dragons of the tiling that result from the nine edges of Plate 16. 


2.6 Metrics for Strings 

In Sect. 2.1, we defined a metric q±/ 2 for the space E^ of infinite strings from 
the two- letter alphabet {0, 1}. There are other, equally good, metrics for the 
same space. 

Metrics for 01-Strings 

Let r be a real number satisfying 0 < r < 1. A metric g r is defined on E^ 1 in 
the same way as the metric Q 1 / 2 ' if cr = aa 1 , r = qt', where the first character 
of o' is different than the first character of r', and if k = |a| is the length of 
a, then 

Q r (cr,T) = r k . 

Exercise 2.6.1. (1) g r is a metric on 

(2) The basic set [a] has diameter r^, for all a € E^*\ 

(3) The space (E( u \ g r ) is complete, compact, and separable. 

Proposition 2.6.2. The metric spaces constructed from E^ using the dif- 
ferent metrics g r are all homeomorphic to each other. 


76 


2 Metric Topology 


Proof. Let 0 < r, s < 1. If h: E^> — > is the identity function h(cr) = a, 

then we will show that h is a homeomorphism from {E^ u \ gf) to [E^, gf). 
It is enough to show that h is continuous, since applying this result with r 
and s interchanged will then show that ft, -1 is continuous. 

Given e > 0, choose k so that s k < e, then choose 6 = r k . So if cr, r £ 
E^ with g r (a,r) < 5, then g r (o,r) < r k , so er and r have at least their 
first k letters in common. But then g s (o,r) < s k < e. This shows that h is 
continuous. (In fact, uniformly continuous.) □ 


Since all of these metrics are homeomorphic, changing from one metric to 
another will make no difference for topological properties, such as compact- 
ness or separability, or such as the topological dimension (Chap. 3). So we will 
normally use the metric g\i 2 when dealing with topological properties. But 
changing from one metric to another does make a difference for metric proper- 
ties, such as fractal dimension (Chap. 6). Here is a simple example where only 
one of the metrics g r is appropriate: the addressing function for the Cantor 
dust (see p. 14) is a lipeomorphism. 

Proposition 2.6.3. Let h: E^ — > R. be the addressing function for the Can- 
tor dust, satisfying 


h(0o) 

h(lo) 


MfO 

3 

h(o) + 2 
3 


Then we have 

^£>i/3(ct,t) < | h{a) - h(r)\ < £>i/ 3 (cr,T). 
Proof. Let er, r be given. Let a be their longest common prefix: 


er = aa' , 
t = or 7 , 


where the first letters of o' and t ' are different. I will prove the inequalities 
by induction on the length |a|. If |a| = 0, that is a is the empty string A, 
then £>i /3 (er, r ) = 1; the two real numbers h(o),h(r) both lie in [0,1], so 
|/i(cr) — h(r)\ < 1; and er, r begin with different letters, so one of h{o),h{r) is 
in [0,1/3] and the other is in [2/3,1], and thus \h(o) — h(r) | > 1/3. So the 
result is true when the length |a| = 0. 

Suppose the result is known when |a| = fc, and consider the case |a| = fc+1. 
Then either a = 0/3 or a = 1(3 for some (3 with |/3| = k. We will take the case 
a = 1/3; the other case is similar. By the induction hypothesis, applied to f3o' 
and /3r 7 , 

^£ , i/ 3 (/3cr 7 ,/3r') < |/i(/3cr') -/i(/3r')| < £> 1/3 (/3cr 7 ,/3T 7 ). 
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But 


and 


01/3 r ) = 


\h(a) — h(r) | = | /i(ch<t / ) — /i(ar')| 

= | /i(l/3cr / ) — /i(l/3r , )| 

/i(/3a') + 2 h(Pr') + 2 
3 3 

= ^ - h{pr')\. 

So the inequalities for this case follow. □ 

Exercise 2.6.4. If r ^ 1/3, then the addressing function h is not a lipeomor- 
phism from {E^ u \ g r ) to M. 

The range /i[{0, of the function h in Proposition 2.6.3 is, of course, 

exactly the triadic Cantor dust C. The proposition shows that the space 
{0, of infinite strings is homeomorphic to the Cantor dust C. 

There are other (useful) ways to define a metric on the space E^\ Now 
let us assign a positive real number w a to each node a of the binary tree . 
If they satisfy the right conditions, then we will be able to define a metric g 
on so that the basic set [a] will have diameter exactly w a . 

Proposition 2.6.5. Let a family w a of positive real numbers be given, one 
for each node a in the infinite binary tree E W . Suppose: 


w a > W /3 if a < /3 
lim w a t n = 0 for a € E^ . 

n— >oo 

Then there is an ultrametric g on E^ u > such that diam[a] = w a for all a. 

Proof. Define g as follows. If a = r, then g(a, r) = 0. If a ^ r, then g(a , r) = 
w a , where a is the longest common prefix of rr and r. 

Clearly g(a,r) > 0. If a ^ r, then g(a,r) = Wk > 0. The equation 
g(a , r) = g(r, a) is also easy. 

So I must verify the ultra-triangle inequality, 

g(a,r) < ma x{g(a,9), g(e,r)}. 

If two (or three) of the strings a, 9, r are equal, then this is trivial. So suppose 
they are all different. Let a be the longest common prefix of a and 9 ; let fi 
be the longest common prefix of 9 and r; let 7 be the longest common prefix 
of cr and t. Now a and (3 are both prefixes of 9 , so one of them is a prefix 
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of the other. If a < (3, then a is a prefix of both a and r, so a < 7, and 
therefore 


= Wry < w a = g(cr,9) < ma x{g(a,9), g(9,r)}. 

The case (3 < a is similar. 

Now I must show diam[a] = w a . If er, r £ [a], then the longest common 
prefix (3 of a and r is > a, so g(a,r) = wp < w a . Therefore diam[a] < w a . 
Choose any er £ E^; then aOer, ala £ [a] and g(aOa,ala) = w a . Therefore 
diam[a] > w a . □ 

Exercise 2.6.6. Let g be the metric defined on E ^ using diameters w a , 
where 


w a > w /3 if a < f3 
lim Wa t n = 0 for er £ E^\ 

n — too 


Then 

(1) (E^,g) is homeomorphic to (E^, gi/2) ■ 

(2) The countable set { [a] : a £ } of “basic open sets” is equal to the set 

{ B r (a) : er £ E^ u \r > 0 } of all open balls, and to the set of all closed 
balls. 

Proposition 2.6.7. Let A be any subset of E^ with at least two elements. 
Then there is a basic open set [a] such that A C [a] and diamTi = diam([a]). 

Proof. Let a be the longest common prefix for the set A (as in Proposi- 
tion 1.3.2). The string a is finite (possibly empty) since A has at least two 
elements. Clearly AC [a], and therefore diamTl < diam[o]. Now choose 
er £ A. If k is the length |a|, then of course a\k = a. Since er|"(fc + 1) is not 
a common prefix for A, there is r £ A with r\(k + 1) ^ erf (A; + 1). That 
means that a is the longest common prefix for the pair {er, r}, and therefore 
g(u 1 t ) = w a . Therefore diamTl > w a = diarri [a] . □ 


Other String Spaces 

It is probably clear to the reader by now that what has been done for strings 
from the alphabet {0, 1} depends very little on that particular choice. Most 
of it could equally well be done for any (finite) alphabet. 

Let E be any finite set; we assume that E has at least two elements. We 
call the elements of E “letters” and we call E an “alphabet” 

Exercise 2.6.8. Formulate definitions of the following: string from the alpha- 
bet E\ E length |a| of a string a; E^; E^\ [ a] . Formulate variants of the 
following results for this setting: 1.3.2, 2.1.8, 2.6.1, 2.6.2, 2.6.5, 2.6.6, 2.6.7. 
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The “continuous” function is useful, but perhaps unfamiliar, on these 
string spaces. Roughly speaking, continuity of a function / means that any 
finite amount of information about /(er) is determined by only a finite amount 
of information about a. Here is a more precise statement: 

Exercise 2.6.9. Let E\ and E 2 be two finite sets. Let a £ E^ 1 and let 
f: E — ► E^ 1 be a function. Then the following are equivalent: 

(1) / is continuous at er; 

(2) for every integer n there exists m so that for r £ e[^\ if we have rfm = 
a I'm then we have /(r) |"n = /(er) |"n. 

Path Spaces of Graphs 

In Chaps. 4 and 7 we will require a further generalization of the string spaces 

£(“). 

Figure 2.6.10 shows examples of certain “graphs” that we will be using. 
There is a finite set V of vertices or nodes , and there are edges. Each 
edge goes from one vertex to another (or possibly back to the same one). 
The direction is important, so this is a directed graph. There may be more 
than one edge connecting a given pair of nodes, so this is a multigraph. This 
informal description (together with the pictures) is probably enough to tell 
you what a directed multigraph is. But we will need a more mathematically 
sound definition. 

A directed multigraph consists of two (finite) sets V and E, and two 
functions i: E V and t: E — > V. The elements of V are called vertices or 
nodes ; the elements of E are called edges or arrows. For an edge e, we call 
i[e) the initial vertex of e, and we call t(e) the terminal vertex of e. We 
will often write E uv for the set of all edges e with i(e) = u and t(e) = v. 

There is an example pictured in Fig, 2.6.11. In this case, V = {S,T} and 
E = {a, b, c, d, e, f}. We have, for example, i(c) = S, t(c) = T, i(f) = i(f) = T. 

A path in a directed multigraph is a sequence of edges, taken in some 
order, so that the terminal vertex of one edge is the initial vertex of the next 




Fig. 2.6.10. Directed Multigraphs 
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b 


c 


d 


e 


Fig. 2.6.11. An Example 



edge. A path will often be identified with a string made up of the labels of the 
edges. Here are some examples of paths in the example: c or bedfe or aaaa. The 
initial vertex of a path is, by definition, the initial vertex of the first edge 
in the path. Similarly, the terminal vertex of a path is the terminal vertex 
of the last edge in the path. We extend the functions i and t accordingly. For 
example, if a = bedfe, then i(a ) = S and t(a ) = S. 

^ ^ / .j, \ •••• • 

We will write Euv for the set of all paths with initial vertex u and terminal 
vertex v. We may say that such a path goes from u to v , or that it connects 
u to v. The number of edges in a path is its length, written |a|. So in the 
example a = bedfe, we have |a| = 5. We will write E ^ for the set of all paths 
from u to v of length n; and Eu for the set of all paths of length n with 
initial vertex u; and E ^ for the set of all paths of length n. The empty set 
conventions will work out best if we say (by convention) that for each u £ V, 
the set Euu has one element, the empty path from u to itself, written A u . 
Of course, we may identify E with E^ and E uv with Euv ■ Note that the 
conventions have been set so that if the strings a, (3 represent paths, and the 
terminal vertex of a is equal to the initial vertex of (3 , then the concatenated 
string ot(3 represents a path, as well. 

The case that we have considered before, where all strings from a given 
alphabet E are allowed, corresponds to a graph with only one vertex, and the 
alphabet E as edge set. 

A path a with i(a) = t(a) is called a cycle. A cycle of length 1 is a loop. A 
directed multigraph is strongly connected iff, for each pair u, v of vertices, 
there is a path from u to v. 

Let ( V,E,i,t ) be a directed multigraph. We will consider the set E < *' ) 
of all paths in the graph. This naturally has the structure of a “tree”: If a 
is a path, then the children of a are the paths ae, for edges e with i(e) = 
t(a). Actually this is not a tree: it is a finite disjoint union of trees, one tree 
Ev ; corresponding to each node v of the graph. A disjoint union of trees is 
sometimes called a forest. So we will call this the path forest of the graph. 

What about infinite paths, corresponding to the infinite strings that we 
have considered before? An infinite string a corresponds to an infinite path 
if the terminal vertex for each edge matches the initial vertex for the next 
edge. We write E ^ for the set of all infinite paths for the graph ( V , E, i, t). If 
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v £ V is a vertex, then we write E^ for the set of all infinite paths starting 
at v. (It usually does not make sense to assign a terminal vertex to an infinite 
path.) There will be one of these path spaces for each vertex of the 

graph ( V,E,i,t ), or one for each tree e[*^ in the forest E W of finite paths. 
If a € E(*\ then we write [a] — {cr £ E^ : a < a }, that is, the set of all 
infinite paths that begin with the finite path a. 

Metrics may be defined on the path spaces E^ in much the same way as 
has already been done. Some notice must be taken of the possibility that some 
nodes in E^ have no children, or only one child. Now if a has no children, 
then [a] = 0, so its diameter must be 0. If a has only one child /3, then 
[cr] = [/?], so diam[a] must be equal to diam[/3]. These cases will not occur if, 
in the graph ( V,E,i,t ), each node has at least two edges leaving it. 

Let (V,E,i,t) be a directed multigraph. Let a family w a of positive real 
numbers be given, one for each node a in the path forest E^*\ We want to 
define distances g(a, t). In fact, only distances between infinite strings with 
the same initial vertex will be needed. So we can think of defining several 
(disjoint) metric spaces E^\ one for each node v £ V. We will normally 
write simply g for all the metrics involved. 

Suppose the family w a satisfies 

w a > wp if a < (3 
lim w a \ n = 0 for a £ E^\ 

n — >oo 

The definition for the metrics is as before. If a, r £ E^\ and a ^ r, then 
the two strings have a longest common prefix, since they have at least the 
prefix A v in common. So define g(a,r) = w a , where a is the longest common 
prefix of a and r. This defines ultrametrics g on the spaces E i?' 1 such that 
diam[a] = w a for “most” a. For which a might the equation fail? 

Exercise 2.6.12. Adapt the following results to this setting: 2.6.6, 2.6.7 


2.7 *Remarks 

It is probably more conventional in fractal geometry to limit consideration to 
subsets of Euclidean space. But metric spaces are the proper setting for the 
theory of topological dimension, and perhaps even for the theory of Hausdorff 
dimension. We will need uniform convergence of functions, and the Hausdorff 
metric for sets; even if the primary interest is Euclidean space, these lead 
naturally to other metric spaces. 

On the other hand, I have limited the discussion to metric spaces, rather 
than more general topological spaces. Certainly Hausdorff dimension belongs 
in a metric space. Topological dimension can be done in much greater general- 
ity, but I will leave that to the student who wants to go beyond this book. In 
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fact, whenever it is convenient, the discussion will be restricted to separable 
metric spaces, or even to compact metric spaces. 

Additional material along the lines of this chapter can be found in texts 
called “general topology” or “point-set topology” such as [38] or [41] . 

According to the definition in this chapter, d-dimensional Euclidean space 
is the set R d of d-tuples of real numbers. For our purposes, this is reasonable. 
But in some other contexts, one would say that R d is really more properly 
considered to be d-dimensional Euclidean space together with a distinguished 
coordinate system. This sort of distinction emphasizes the difference between 
two camps called “synthetic geometry” and “analytic geometry” 

The reader aware of such things may have noticed that the axiom of choice 
is freely used in this chapter. There is no better way. If we want to do metric 
topology using sequences, then at least countable choice must be used. 

The spaces E^ u > that have been called here “string models” or “path mod- 
els” are more commonly known as “shift models” and “subshift models’.’ How- 
ever, the word “shift” involved refers to the left shift on these spaces, which 
is never* mentioned in this book, so its use seemed inappropriate. 

For the Hausdorff metric on sets in a metric space see for example [33, §28]. 
Felix Hausdorff ’s mathematical writings are few in number, but immensely 
influential. 


Comments on Exercises 

Do not read this until you have tried to do the exercise yourself! 

Exercise 2.1.2: x<y<z or x>y>z. That is, y is between x and z. 

Exercise 2.1.7: \x + y\ = |rc| + \y\ if and only if one of x, y is a nonnegative 
scalar multiple of the other. 

Exercise 2.1.14: An “if and only if” proof is done in two parts, an “if” part 
and an “only if” part. Two sets are equal when each is a subset of the other. 

Exercise 2.2.10: This can be done from the e, 6 definition (good practice 
using the definition). Or it can be done using Theorem 2.2.9. 

Exercise 2.2.15: Given a Cauchy sequence in R 3 , use the completeness of 
R three times; successively extract subsequences so that each of the three 
coordinates converges. Then show that the resulting subsequence converges. 

Exercise 2.2.30: (1) U = {x : dist(a;, A) < dist (x,B) }. (2) 

U = { x £ S : dist(a:, A) < (1/2) dist(a;, B) } 

V = { x £ S : dist(a:, A) > 2 dist(cc, B) } . 

Exercise 2.2.18: The sets R and (0,1) are homeomorphic. 

Exercise 2.3.12: Show that is countably compact. If A is infinite, then 
at least one of the sets in [0], An [1] is infinite. Then at least one of the 
subsets A fl [a], a £ E^ 2 \ is infinite. Etc. 


With this exception 
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Exercise 2.3.18: In the plane, consider the two graphs xy = 1 and xy = —1. 
Exercise 2.5.2: Suppose A n — > A, and let 

B = { x : there is a sequence (x n ) with x n £ A n and x n — > x } . 

First, A C B: Since any element x £ A is within distance D(A, A n ) + 1/n of 
some element x n of A ni we have x n — > x. For B C A: Since any element of B 
is within distance e of elements of A n for large enough n, and those elements, 
in turn, are within distance e of elements of A. But A is closed, so any point 
with distance 0 from A belongs to A. 


I have, with difficulty, been prevented 
by my friends from labeling [the book]: 
What Every Young Analyst Should Know. 
— J. L. Kelley, General Topology 



3 


Topological Dimension 


The sets of elementary geometry have associated with them a dimension. 
Points have dimension 0. Curves have dimension 1. Surfaces have dimension 
2. Solids have dimension 3. When we leave elementary geometry, there is 
the possibility of considering point sets not falling into any of these clear-cut 
groups. Mathematicians have defined a general notion of dimension to help 
out in the study of such sets. In fact, they have done it in several different 
ways; one way of defining dimension may be useful for one purpose but not 
for another purpose. 

We will discuss more than one of these definitions. The definitions that 
will be considered fall generally into two broad classes: topological dimension 
and fractal dimension. The first one is a topological dimension known as the 
“covering dimension’.’ Keep in mind that these are topological dimensions 
also in a more technical sense: if two spaces S and T are homeomorphic, then 
their dimensions are equal. 


3.1 Zero-Dimensional Spaces 

Figure 3.1.1 suggests that if a set should be considered “ 1-dimensional” then 
it can be covered by small open sets that intersect only 2 at a time. (That is: 
any 3 of the sets have empty intersection; or, each point belongs to at most 2 
of the sets.) A set is considered “0-dimensional” if it can be covered by small 
open sets that are disjoint. A set is “2-dimensional” if it can be covered by 
small open sets that intersect only 3 at a time. The usual “brick” packing 
shown in Fig. 3.1.2 covers the plane by closed rectangular sets that intersect 
at most 3 at a time — if the rectangles are enlarged slightly to open sets, we 
can still arrange that they intersect at most 3 at a time. 

This idea is reasonable for compact metric spaces (see Theorem 3.2.2), but 
needs a bit of fine-tuning for non-compact spaces. 

In this section we will discuss zero-dimensional spaces. In Sect. 3.2 we will 
go on to higher dimensions. 
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Fig. 3.1.2. A cover of the plane 


Definition 

If A and 53 are two collections of sets, we say that 13 is subordinate to A iff 
every B £ 13 there is A £ A with B C A. Let S' be a metric space, and let 
A be an open cover of S. A refinement of A is an open cover 53 of S that is 
subordinate to A. We also say say 53 refines A. For example, a subcover of 
A is a refinement of A. 

Exercise 3.1.3. Let S be a metric space. Then S is compact if and only if 
every open cover has a finite refinement. 

Let S be a metric space. A set A in a metric space S will be called clopen 
iff it is both a closed set and an open set. In particular, 0 and S are clopen. 
If A, B are clopen, then so are An B, AUB, and A\B. (The collection of all 
clopen subsets of S is an algebra of sets in the sense to be defined on p. 147.) 

A clopen partition of S is an open cover of S consisting of disjoint clopen 
sets. The space S is zero- dimensional iff every finite open cover of S has a 
finite refinement that is a clopen partition. 

Example: If S is finite, then S is zero-dimensional. To see this: Each single- 
ton {a;} is an open set, therefore a clopen set. And any open cover is refined by 
the open cover made up of all these singletons, which is a clopen partition of S. 

Another good example is the Cantor dust C ; see p. 2 for the definition 
and notations C, C n . Let A be an open cover of C. For each x £ C, there is 
an open set A £ A with a ; £ A, and therefore a positive r with B r (x) flC C A. 
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But then there is an integer n with 3 - " < r, so the interval I of C n that 
contains x has length 3 -n and / (~l C C A. The complement in C of I is a 
finite union of closed intervals, so I fl C is clopen in C. For each x £ C choose 
such an interval I x with I x nC clopen in C and I x C A for some A £ A. Thus 

A\ = { I x : x £ C } 

is an open cover of C. Since C is compact, there is a finite subcover of A\, say 
A 2 = {Ji, I 2 , ■ ■ • , Ik}- Now if we write Ji = Ji, J 2 = h\Ji, J 3 = h\ -h, and 
so on, we get a finite cover A 3 = { Ji, J 2 , • • ■ ,Jk} of C by sets clopen in C, 
which is still a refinement of A. The sets Ji,J 2 ,--- are disjoint, so the cover 
A 3 is a clopen partition of C. 

The next example to consider is the line R. It is not zero-dimensional. To 
prove this, I will need to show that there are not very many clopen sets in R. 

Theorem 3.1.4. The only clopen sets in the space R are 0 andR. Therefore, 
R is not zero- dimensional. 

Proof. Let A C R, and suppose 1/0 and A ^ R. I must show that A is 
not a clopen set, or, equivalently, that A has a boundary point. We will define 
recursively two sequences, (x n ) and (y n )- First, we may choose a point xq £ A 
since A ^ 0 . Also, we may choose a point yo qL A since 1 / R. After x n 
and y n have been defined, with x n £ A and y n £ A, we want to define x„+i 
and y„ + i. Consider the midpoint z n = (x n + y n )/ 2. If z n £ A, then define 
Xn +1 = z n , 2/n+i = Vn, and if z n £ A, then define x n+1 = x„, y n + 1 = z n . So 
in any case, we get x n+ i £ A and y n+1 g A , with |x n+ i — y n +i \ = \x n -y n |/2. 
So by induction \x n — y n \ = |xo — yo\/2 n . Thus \x n — y n \ — >■ 0 as n — > 00 . 
Also, |x n+ i - x n | < \x n - y n | = |x 0 - 3 / 0 1/2", so (x n ) is a Cauchy sequence. 
Let x = lim„ x n . Because \x n — y n \ — > 0, we have also y n — > x. Therefore x is 
a boundary point of A. So A is not a clopen set. 

Now the two-element cover (— 00 , 1) U (—1, 00 ) of R has no clopen refine- 
ment. So R is not zero-dimensional. □ 

It is worth pointing out the following, which has the same proof. 

Corollary 3.1.5. Let a <b. The only clopen subsets of the space [ a,b } are 0 
and[a,b\. Therefore, [ a,b } is not zero- dimensional. 

Here is a result promised in Chap. 2, p. 70: 

Proposition 3.1.6. Let C be the Cantor dust. There is no continuous func- 
tion from the interval [ 0 , 1 ] onto C . 

Proof. Suppose h: [0, 1] — > C is surjective. The set M = [0, 1/3] flC is a clopen 
subset of C. Both M and its complement C\M are nonempty. So both the 
set h~ 1 [M ] and its complement h~ 1 [C\M] are nonempty. By Corollary 3.1.5, 
not both sets are open in [0, 1]. By Theorem 2.2.6, h is not continuous. □ 
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Properties for Zero-Dimensionality 

Here are some variants of the definition. 

Theorem 3.1.7. Let S be a metric space. The following are equivalent: 

(1) S is zero- dimensional. 

(2) If {Ui,U 2 , - • • ,Uk} is any finite open cover of S, then there exist sets 
B i C U\, f ?2 C {/ 2 , ■ ■ ■ , Bk C Uk, such that {B\,B 2 , ■ • • , Bk} is a clopen 
partition of S. 

(3) If {U,V} is an open cover of S, then there exist open sets A C U, and 
B C V such that A U B = S and A n B = 0 . 

(4) If {U, V} is an open cover of S, then there exist closed sets ACU, and 
B C V such that A LI B = S and A D B = 0. 

Proof. (2) => (3) and (2) => (1) are clear. (3) -4=> (4) follows because in 
both cases the sets A, B are clopen. 

(1) ==> (2). Suppose S is zero-dimensional. The open cover {iTi, C/ 2 , - - - , Uk} 
is refined by a clopen partition W. For each W £ W there is at least one i 
such that W C Up, choose one of them, and call it i(W). Now for each i, let 

Bi = \J{W £ W : i(W) = i}. 

Then the sets B t are open, and (J - Bi = W = S. The complement of 

Bj is 

[j{W€ W: 

which is open. So Bj is clopen. If x £ S, then (since W is a partition) x 
belongs to at most one of the sets W. But x £ Bi only if x £ W for some W 
with i(W) = i. So it belongs to at most one of the sets Bi. That is, the sets 
Bi are disjoint. 

(3) (2). Suppose S has the property (3). Let {U\, U 2 , • • • , Uk} be an 

open cover of S. If k = 1, then this cover is already a clopen partition. So 
suppose k > 2. Now write U = U\, V = (J*_ 2 Bi- Then these sets cover 5, so 
by the hypothesis (3), there exist clopen sets A C U, B C V with A U B = S 
and An B = 0. Let Bi = A and Bi = B n Ui for i > 2. Then Bi C Ui for all 
i, we have UiLi Bi = S, and Bi (~l B 2 = 0. Repeat this construction a finite 
number of times, once for each two-element subset of {1, 2, • • • , k}, to arrange 
the same conclusion with all of the intersections of two of the sets empty. □ 


Separation 

Recall Exercise 2.2.30(2): If A and B are disjoint closed sets, then there open 
sets U and V with U A A, V 0 B, and U D V = 0. Zero-dimensional spaces 
have a stronger separation property. 
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Proposition 3.1.8. Let S be a metric space. Then the following are equiva- 
lent: 

(1) S is zero- dimensional. 

(2) For every pair A, B of disjoint closed sets, there is a clopen set U with 
complement V = S\U so that U D A and V D B . 



Fig. 3.1.8. Illustration for Theorem 3.1.8 


Proof. Suppose S is zero-dimensional. Let A, B be disjoint closed sets. Then 
their complements A! = S \ A and B' = S \ B are open. Since A (~l B = 0, 
the complements cover: A' U B' = S. So by Theorem 3. 1.7(3), there are open 
sets U, V with U C B\ V C A', U U V = S, U ft V = 0. So U, V are clopen, 
U D A and V D B. 

Conversely, suppose (2) holds. Let {Ui, U 2 } be an open cover of S. Then 
the complements A = S \Ui, B = S \U 2 are disjoint closed sets. Therefore 
there exists clopen set V with complement U = S\ V so that V A A, U A B. 
So V C U 2 and U C JJ\, and by Theorem 3. 1.7(3), S is zero-dimensional. □ 


Base for the Topology 

Dimension zero is related to the existence of a clopen base for the topology. 

Proposition 3.1.9. Let S be a zero- dimensional metric space. Then there is 
a base for the topology of S consisting of clopen sets. 

Proof. Let U C S be an open set and let xq £ U. The distance r := 
dist^ch-S \ U) > 0. So S is covered by the two open sets U and V = 
{ x £ S : g(x, Xq) > r/2 }. This cover is refined by a clopen partition A. This 
shows there is a base for the topology of S consisting of clopen sets. □ 

The converse in the compact case can be proved using the argument from 
the Cantor dust, above. 
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Proposition 3.1.10. Let S be a nonempty compact metric space. Assume 
there is a base for the topology of S consisting of clopen sets. Then S is zero- 
dimensional. 

Proof. Let B be a base for the topology of S consisting of clopen sets. Let A 
be an open cover of S. For each x £ S, there is an open set A a A with x £ A, 
and therefore there is some B £ B with x £ B C A. For each x £ S choose 
such B x £ B. Then 

Ai = { B x : x G S } 

is an open cover of S. By compactness, there is a finite subcover, say A 2 = 
{Bi,B 2 , • • • , B k }. Now if we write Ji = B\, J 2 = B 2 \ Ji, J 3 = B 3 \ J 2 , and 
so on, we get a finite cover A 3 = { Ji, J 2 , • • ■ , J k } of C by clopen sets, which 
is still subordinate to A. The sets Ji, J 2 , • • • are disjoint, so .A 3 is a clopen 
partition of S'. □ 

In fact, the converse holds more generally for separable metric space. 

Proposition 3.1.11. Let S be a separable metric space. Then S is zero- 
dimensional if and only if there is a base for the topology of S consisting 
of clopen sets. 

Proof. One direction was done in Proposition 3.1.9. For the converse, assume 
there is a base B for the topology of S consisting of clopen sets. Let {U,V} 
be an open cover of S. There is a collection Hi C B with (J Hi = 17. By the 
Lindelof property (Theorem 2.3.1), there is a countable subcollection lf 2 C Hi 
with (J U 2 = U. Similarly there is a countable collection V 2 C B with |J V 2 = 
V. Enumerate the union: 

H 2 UV 2 = {Gi,G 2 ,G 3 ,---}. 

The sets G m are clopen, and (J G m = S. Define Hi = Gi, H 2 = G 2 \ Gi, and 
in general H m = G m \ (Gi U • • • U G m _i). So the sets H rn are clopen, disjoint, 
and U H m = S. Now let 

E=\J{H m : H m CU}, F = H m %U}. 

Then U, V are open, U C\V = 0 , U UV = S, so U,V are clopen. Also ECU 
and F CV. This shows that S is zero-dimensional. □ 

Exercise 3.1.12. The set Q of rational numbers (with the usual metric) is 
zero-dimensional. The complementary set K \ Q of irrationals is also zero- 
dimensional. 

Exercise 3.1.13. Let AT be a nonempty compact set in R 2 . Let / 1 , f 2 , ■ ■ ■ , f n 
be contracting similarities of R 2 to itself. Suppose K satisfies the self- 
referential equation 

n 

K=\J.f i [K\, 

i= 1 

and that fi[K\ PI fj [K] = 0 for i 7 ^ j. Show K is zero-dimensional. 
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Sum Theorem 

The real line is not zero-dimensional, but according to Exercise 3.1.12 it can 
be written as a union of two zero-dimensional sets. At least the union behaves 
well for closed sets. 

Theorem 3.1.14. Let S be a metric space. Let F\ and F 2 be closed sets in 
S. If F\ and F 2 are both zero-dimensional, then F\ U E 2 is zero-dimensional. 



Fig. 3.1.14. Illustration for Theorem 3.1.14 


Proof. Let F\ and E 2 be zero-dimensional and closed. (Remark: Since F\ is 
closed, a subset E C F\ is closed in F\ if and only if E is closed in S. The 
same may not be true for “open” : a set E C F\ open in F\ need not be open in 
S.) We must show that F\ U E 2 is also zero-dimensional. Let A , B be disjoint 
closed sets in F\ U F 2 . Now AC\F\ and B n F\ are closed sets, so they are 
closed in F\. They are disjoint. So, since Ej is zero-dimensional, there exists 
a set K clopen in Ej with complement L = F\\K such that K D A (T Ei and 
L D B fl Ei. Now K and L are closed in S. The two sets ( K U A) (~l F 2 and 
(lUB)n E 2 are disjoint and closed in F 2 . So, since E 2 is zero-dimensional, 
there exists a set P clopen in E 2 with complement Q = F 2 \ P such that 
P D (K U A) n E 2 and Q D (E U B) n E 2 . Now let U = K U P, V = L U Q. So 
U and V are closed, disjoint, and U U V = F\ U E 2 . So U and V are clopen 
in El U F 2 . And U D A, V D B. So by Proposition 3.1.8, Ei U E 2 is zero- 
dimensional. □ 


3.2 Covering Dimension 

Let n > — 1 be an integer. The order of a family A of sets is < n iff any n + 2 
of the sets have empty intersection. If n > 0, then we say A has order n iff it 
has order < n but does not have order < n — 1. For example, a family A of 
nonempty sets is disjoint if and only if its order is 0. A family of sets A has 
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order —1 iff it is empty or is the singleton {0}. The set of disks in Fig. 3.1.1 
has order 1. The set of rectangles in Fig. 3.1.2 has order 2. 

The “covering dimension” will be defined now. The covering dimension 
of a metric space will be either an integer > -1 or the extra symbol oo, 
understood to be larger than all integers. 

Let S' be a metric space. Let n > — 1 be an integer. We say that S has 
covering dimension < n iff every finite open cover of S has a refinement 
with order < n. The covering dimension is n iff the covering dimension is < n 
but not < n—l. We will write Cov S = n in that case. If the covering dimension 
is < n for no integer n, then we say CovS = oo. Covering dimension is also 
known as Lebesgue dimension. 

Let us consider the simplest cases. If CovS = —1, then the open cover 
{S} is refined by a cover of order —1, which is either 0 or {0}; this is a cover 
only if S = 0. So Cov S = — 1 if and only if S = 0. 

For nonempty S, the covering dimension is 0 if and only if S is zero- 
dimensional in the sense defined in Sect. 3.1. 

Basics of Covering Dimension 

Here are some simple variants of the definition of covering dimension. 

Theorem 3.2.1. Let S be a metric space, and let n be a nonnegative integer. 
The following are equivalent: 

(1) Cov S < n 

(2) If {U\, U 2 , • • • , Uk} is any finite open cover of S, then there exist open sets 
Bi C U\, B 2 C U 2 , ■ ■ ■ , Bk C Uk, such that {£?i, B 2 , • • • , Bk} is an open 
cover of S with order < n. 

(3) If {U\, U 2 , ■ ■ ■ ,U n + 2 } is an open cover of S, then there exist open sets 

B\ C Ui, B 2 C U 2 , ‘ , B n+ 2 C U n+ 2 , such that US. 2 = B an ^ 

ns 2 b> = 0. 

(4) If {Ui, U 2 , ■ ■ ■ ,U n+ 2 } is an open cover of S, then there exist closed sets 

Fi C Ui, F 2 C F n+ 2 C U n+2 , such that (J”=i 2 F i = S and 

nS 2 p t = 0. 

Proof. (2) ==^ (3) and (2) =» (1) are clear. 

(1) =» (2). Suppose Cov S < n. The open cover {U\, U 2 , ■■■ , Uk} admits 
a finite refinement W of order < n. For each W € W there is at least one i 
such that W C Up, choose one of them, and call it i(W). Now for each i, let 

Bi = \J{W G W : i(W) = i}. 

Then the sets Bi are open, and (J,- Bi = Uwew W = S.li x £ S, then (since 
W has order < n) it belongs to at most n + 1 of the sets W . But x £ Bi only 
if x £ W for some W with i{W) = i. So x belongs to at most n + 1 of the 
sets Bi. 
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(3) => (2). Suppose S has the property (3). Let {U\, [/ 2 , ••• , Uk} be an 
open cover of S. If k < n + 1, then this cover itself already has order < n. 
So suppose k > n + 2. Now write W\ = U\, W 2 = U 2 , ■■■ , W n+ 1 = U n +\ and 
W n + 2 = Ui=n +2 Bi- Then these sets cover S, so by the hypothesis (3), there 
exist open sets V t C Wj with f Vi = S and f Vi = 0. Let B t = V for 

i < n + 1 and Bi = V n +2 fl Ui for i > n + 2. Then Bi C Ui for all i, we have 

U i= i Bi = S , and [^=1 Bi = 0- Repeat this construction a finite number of 
times, once for each subset of {1, 2, • • • , k} with n + 2 elements, to arrange 
the same conclusion with all of the intersections of size n + 2 empty. 

(3) => (4). There exist open sets Bi C Ui with [JBi = S and f]Bi = 0. 
Now S \ B\ C US 2 so by Corollary 2.2.29, there is an open set Vi with 
S\B 1 CVx C V[ C (J"+ 2 2 Bi. Let T 1 ! = 5 \ Vi. So we have Fi Q_B ± and 
F U U "=2 Bi = S. Next, there is an open set V 2 with S \ B 2 C V 2 C V 2 C (S \ 
FOCUS 2 Bi . Let F 2 = S\V 2 - So we have F 2 C B 2 and FiUi^UlJ"^ 2 Bi = S. 
Continue in this way. 

(4) => (3). There exist closed sets Fi as in (4). Now the closed set Fi is 

a subset of the open set Uid (S\ Bi), so there is an open set B\ with 

F 2 C5iC^C C/in(F\n”= 2 2 ^)- SolhQ Ux and R7nnr=2 2 T,; = 0. Next, 
there is an open set B 2 with F 2 C B 2 C B 2 C C/ 2 fl (5 \ (Bi D p|"J " 3 2 F,)) , so 

B 2 C u 2 and B\ D B 2 C Fj = 0. Continue in this way. □ 

Covering dimension for compact metric spaces may be defined in a simpler 
way. If A is a cover, then its mesh is sup^g^ diam A. 

Theorem 3.2.2. Let S be a compact metric space, and let n > —1 be an 
integer. Then Cov S < n if and only if for every e > 0, there is an open cover 
of S with order < n and mesh < e. 

Proof. Suppose Cov S < n. Let e > 0. The collection ll of all open sets of 
diameter < £ is a cover of S. So it has a refinement 'B of order at most n. But 
B has mesh < e. 

Conversely, suppose that for every £ > 0, there is an open cover of S 
with order < n and mesh < e. Let U be any finite open cover of S. By 
Theorem 2.3.22, the Lebesgue number r of 11 is positive. Let B be an open 
cover of S with order at most n and mesh less than the minimum of r and e. 
Then by the defining property of the Lebesgue number, B is a refinement of 
U. So Cov S < n. □ 

Separation 

The separation theorem for dimension 1 looks like this: 

Proposition 3.2.3. Let S be a metric space with Cov S < 1. Let A,B,C be 
closed subsets of S with A D B n C = 0 . Then there exist open sets U, V , W 
such that U A A, V A B, W A C, UUVUW = S andU nVCW = 0. 
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Proof. Sets A, B,C are closed, so the complements A' = S\ A, B' — S\B, 
C' = S \C are open. These complements cover S. So by Theorem 3. 2. 1(4), 
there exist closed sets F, G, H with F C A! , G C B ', H C C', FUGU H = S, 
and FnGHH = 0. Define U = S\F, V = S\G, W = S\H. Then U,V,W 
are open, U A A, V A B, W A G, U U V U W = S, and U (T V D W = 0. □ 

Using Theorem 3.2. 1(3) in place of Theorem 3. 2. 1(4) we obtain a variant: 

Proposition 3.2.4. Let S be a metric space with Cav S < 1. Let A,B,C be 
closed subsets of S with A (~l B (~l C = 0. Then there exist closed sets K, L, M 
such that KA A, LAB, MAC, KULUM = S and K D L D M = 0 . 

Exercise 3.2.5. Prove converses for Propositions 3.2.3 and 3.2.4. 

Exercise 3.2.6. For general n, formulate the separation theorem correspond- 
ing to covering dimension n. 

Here is the three-set separation for arbitrary metric space. Of course we 
cannot arrange that U\ U U 2 U U 3 = S in general. 

Proposition 3.2.7. Let S be a metric space. Let Ai,A 2 ,A^ C S be closed 

sets. Assume A\ fl A 2 0 = 0. Then there exist open sets U\, U 2 , U 3 C S 

with Ui A A\, U 2 A A 2 , t/3 A H3, and U \ O U 2 O t/3 = 0. 

Proof. For any x £ S, define di(x) = dist(x, Ai), d 2 ^x) = dist(a;, A 2 ), x) = 
dist(x, A 3 ). These are continuous functions of x and d\{x) +d 2 {x) + d 3 (x) > 0 
for all x. Define 

Ui = { x G S : di(x) < (l/4)(di(x) + d 2 (x) + d 3 (x)) } , 

U 2 = { x e S : d 2 {x) < (l/4)(di(x) + d 2 (x) + d 3 (x)) } , 

U 3 = { x G S : d 3 (x) < (l/4)(di(x) + d 2 {x) + d 3 (x)) } . 


Then Uj is open, Uj A Aj , and 


Uj C { x G S : dj(x) < (l/4)(di(x) + d 2 (x) + d 3 (x)) } 

for j = 1, 2, 3. We see that the intersection U\ 0 U 2 0 U 3 is empty by adding 
the three inequalities. □ 

Exercise 3.2.8. Formulate the separation theorem corresponding to n closed 
sets in a general metric space. 


Sum Theorem 

The sum theorem for dimension 1 is similar to the sum theorem for dimension 
0. First, the union of two closed sets. 
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Theorem 3.2.9. Let S be a metric space, and let F-[ , P 2 be closed subsets. If 
Cov.Fi < 1 and CovP 2 < 1, then Cov(Pl U P 2 ) < 1. 

Proof. Let A, B,C be closed sets in F\ U P 2 with A fl B fl C = 0. The three 
sets TdFi, BC\Fi , CTlPi are closed in F\, and their intersection is empty. So 
(Proposition 3.2.4) there exist closed sets K, P, M C F\ so that K A A n Fi, 
L A F?nFi, M A CnFi, KnLnM = 0, and KULUM = F±. The three sets 
(KUA) nF 2 , (PUP) flP 2 , (MUC) flP 2 are closed in P 2 , and their intersection 
is empty. So there exist closed sets P,Q,R C P 2 so that PA (K U A) fl F 2 , 
Q A (P U B) (~l F 2 , R A (M U C) n F 2 , P n Q n R = 0 and P U Q U R = F 2 . 
Define E = K UP, F = LUQ , G = MUR. Then E, F , G are closed in Pi UP 2 , 
E A A, F A B, G A c, E n F n G = 0, and E U F U G = Pi U P 2 . Therefore 
Cov(Pi U P 2 ) <1. □ 

We will see later (Theorem 3.4.11) that the union of two zero-dimensional 
sets has dimension at most 1, even if the sets are not closed. 

Exercise 3.2.10. Let n be a positive integer, let S' be a metric space, and let 
Pi, P 2 be closed subsets. If Cov F\ <n and Cov P 2 < n, then Cov(Pi U P 2 ) < 

n. 


Next, the union of a sequence of closed sets. 

Theorem 3.2.11. Let S be a metric space, and for each i £ N, let Fi C S be 
a closed set. If Cov Fi < 1 for all i, then Cov(J ieN Pj < 1. 

Proof. We may assume S = Cov(J- GN Pj. (So “open” will mean open in this 
set.) Let A,B,C be closed sets in S. By Proposition 3.2.7, there exist open 
sets Uq, Vq, Wq such that U 0 A A, Vo A B, Wq A C, and Uq fl Vq n Wq = 0. 

Beginning with these, we will recursively define open sets Ui,Vi,Wi such 
that 


Vi - 1 C Ui, V-! c Vi, W t - 1 c Wi, 
ulnvnWi = 0, FiCUiUViUWi, (*) 

where Fq = 0. Now (*) is satisfied for i = 0. Fix k > 0 and assume (*) is 
satisfied for all i < k. The three sets Uk-i fl Pfc, Vk- 1 C Pfe, Wfe_i fl Pfc are 
closed and have empty intersection. Since Cov Fk < 1 , by Proposition 3.2.4 
there are closed sets Kk,Lk,Mk C P fe with Kk A Uk-i Cl Pfe, Pfc A Vk- 1 flPfc, 
Mk A Wk-iCFk, KkCLkCiMk = 0, and FffcUPfcUMfc = Pfc. Now the three sets 
Uk-i U PTfc, Vfc_i U Pfc, Wfc_i U Mfc are closed and have empty intersection. 
So there are open sets Uk,Vk,Wk with Uk A Uk-i U Kk, Vk A Vk- 1 U Pfc, 
Wk A Wk -1 U Mk, Uk fl 14 fl Wk = 0. These sets satisfy (*) for i = k. This 
completes the recursive construction. 

Now define 


U=\JUi, V=U Vi, W=\JWi. 

ie N ieN ieN 
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These sets are open, and U^>A, VAB,WAC, and U ft V n W = 0. This 
completes the proof that Co v S <1. □ 

For the following exercise, the argument is like the one above. But this 
may test your ingenuity in finding a good notation for the proof. 

Exercise 3.2.12. Let n be a positive integer and let S' be a metric space. 
For each i e N, let Fi C S be a closed set. If Co v Fi < n for all i, then 
Cov Ui G N F i ^ n - 

Subset Theorem 

Theorem 3.2.13. Let S be a metric space, and T C S a subset. Then 
Cov T < Cov S. 

Proof. If Cov S = oo, there is nothing to prove, so assume Cov S = n for some 
integer n. We must show Cov T < n. This will be done in three stages. 

(1) First, assume T is closed. Let A be an open cover of T. Each element 
A £ A has the form A = E n T, where £ is a set open in S. Also, the 
complement S\T is open in S. Therefore 

A 1 = {ECS : E is open in S, E n T e A } U {5 \ T} 

is an open cover of S. So there is a refinement A 2 of A\ that is an open cover 
of S of order < n. So 


A 3 — { E D T '. E A 2 } 

is subordinate to A, an open cover of T, and of order < n. So CovT < n. 

(2) Now assume T is open. An open set T can be written as a countable 
union of closed subsets, T = (J jeN w here 

Fj = x G S : dist(x, S\T)> — 

Now by (1), Cov Fj < n for all j. So by the sum theorem 3.2.12 we have 
Cov T < n. 

(3) Finally, consider the general subset T of S. Let {Ui, E/ 2 , • • ■ , U n+ 2 } be 

an open cover of T. For each Ui, let Vi be a set open in S with Ui = Vi C I T. 
The set V = U "^ 2 ^ is open, so by (2), Cov V < n. Now {Vi, V 2 , • • • , V^+ 2 } 
is an open cover of V, so there exist open sets Wi C V with W* = 0 

and (J”=i 2 W 7 * = V. But then Wi flT are open in T, and they have intersection 
0 and union T. This shows CovT < n, as required. □ 
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Examples 

We saw in Theorem 3.1.4 that CovR > 1. But in fact, as expected, the 
dimension is exactly 1. 

Lemma 3.2.14. Let a < b be real. Then Cov[a,6] = 1. 

Proof. We must show Cov [a, b] < 1. Let e > 0 be given. Let n £ N be so large 
that 1 /n < e/2. Then 



is an open cover of [a, b] with mesh < e and order 1. 


□ 


Theorem 3.2.15. The line R has covering dimension 1. 

Proof. The line R is the union of the closed subsets [—n,n\ for n £ N. By 
Lemma 3.2.14, Cov[— n, n] = 1 for all n. Therefore by the Sum Theorem 3.2.11, 


Cov R < 1. 


□ 


Exercise 3.2.16. CovR 2 < 2. Figure 3.1.2 may be helpful. 

The fact that CovR 2 > 2 is more difficult to prove than CovR > 1. This 
will be done in Sect. 3.3. 

Exercise 3.2.17. CovR 3 < 3. 

The Sierpinski gasket is described on p. 8. 

Proposition 3.2.18. The Sierpinski gasket has covering dimension 1. 

Proof. I will use the notation S and Sk from p. 8. Each set Sk is made up of 
3 fc filled-in triangles with side 2~ k . For a given value of k, any two of these 
triangles either meet in a point, or else have distance at least 2 — fe / (x/3/2) 
between them. (The trema adjoining one side of the triangle is a triangle with 
side at least 2~ k and therefore altitude at least 2~ k /(\/3/2).) So if r > 0 is 
smaller than 2 _fe /v / 3, then the r-neighborhoods of these triangles are open 
sets which form a cover of S with order 1 and mesh at most 2 _fc + 2r. Therefore 
Cov S < 1. But S contains line segments, which have dimension 1, so Cov S > 
1 also. □ 

Exercise 3.2.19. Let d, n be positive integers. Let K be a nonempty compact 
set in R d . Let /i, fa, ■ ■ ■ ,/„ be similarities of R d to itself. Suppose K satisfies 
the self-referential equation 


n 


K=\JMK\, 


and that the set of images {fi[K\, f 2 [K\,- ■ ■ , f n [I\]} has order 1. Does it 
follow that Cov K <11 
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Exercise 3.2.20. Compute the covering dimension for other sets: the Heigh- 
way dragon (p. 20); the Koch curve (p. 19); the McWorter pentigree (p. 24); 
the twindragon (p. 33); the Eisenstein fractions (p. 34); 120-degree dragon 
(p. 23). 


Number Systems 

Recall the situation from Section 1.6. Let b be a real number, |6| > 1, and let 
D be a finite set of real numbers, including 0. We are interested in representing 
real numbers in the number system they define. 

Write W for the set of “whole numbers”; that is, numbers of the form 

M 

J2 a J bJ - (!) 

j = o 

Write F for the set of “fractions” ; that is numbers of the form 

-l 

Yh a ft- ( 2 ) 

j = - oo 

The set of all numbers represented by this system is the sum of one of each: 

M 

a 3 bJ - ( 3 ) 

i=-o o 

Exercise 1.6.3 may be considered a problem on topological dimension. 

Proposition 3.2.21. Let b be a real number with |6| > 1, and let D be a 
finite set of real numbers, including 0. Then either some real number has no 
expansion in the form (3) or some real number has more than one expansion 
in the form (3) . 

Proof. Suppose (for purposes of contradiction) that every real number has a 
unique expansion in the form (3). We will deduce that R. is zero-dimensional, 
a contradiction. 

The string space is compact (Exercise 2.6.8), and the map sending 
the infinite string of digits to the corresponding number (2) is continuous, so 
F is compact (Theorem 2.3.15). 

I claim that there is a minimum distance between elements of W. Suppose 
not: then choose x n ,y n £ W with x n yf y n and \x n — y n \ — » 0. By subtracting 
the places where they agree, we may assume that x n and y n have only zeros 
in places higher than the highest power of b where they disagree. Let M n 
be the highest power of b where the expansions of x n and y n disagree. Then 
x' n = x n b~ M ’ n ~ 1 and y' n = y n b~ Mn ~ 1 are elements of F, they differ in the 
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first place to the right of the radix point, and still \x' n — y' n \ — > 0. By taking 
subsequences, we may assume that the first place of x' n is the same for all n, 
the first place of y' n is the same for all n, (x' n ) converges, and (y' n ) converges. 
The limits x = lima;^ and y = lim y' n are equal, yet have different first places. 
This contradicts the uniqueness. So there is a minimum distance between 
elements of W. 

Now I claim that the set F is clopen. If not, there is a boundary point 
x. Now F is closed, so x £ F. There is a sequence in R \ F that converges 
to x. So there exist y n £ F and w n £ W with w n ^ 0 and y n + w n — > x. 
By compactness of F we may assume that (y n ) converges, say to y £ F. 
Then w n — > x — y. Since there is a minimum distance between elements of 
W, this means that w n = x — y for large enough n. So we have x = y + ui n , 
contradicting uniqueness of representations. 

Now consider an interval [a, b] in the line. Let e > 0 be given. Let r = 
e/ diarn F, so that rF has diameter e. Then 

(rF + rw ) 

w£W 

is a cover of [a, 6] by disjoint clopen sets with mesh e. Therefore by Theo- 
rem 3.2.2 Cov[a, b] = 0. This contradicts Corollary 3.1.5. □ 


3.3 *Two-Dimensional Euclidean Space 

In this section we will prove that, as expected, the covering dimension of the 
plane R 2 is 2. We will use this notation: 

D = { x € R 2 : |ir| < 1 } , the unit disk; 

S = { x € R 2 : |i| = 1 } , the unit circle. 

Degree (mod 2) 

The preliminaries to the discussion of the dimension of the plane involve a 
“homological” discussion of the degree of a map of the circle to itself, and the 
Brouwer fixed point theorem. 

Theorem 3.3.1. There is no continuous function f : D — > S with f(x) = x 
for all x £ S. 

Proof. If n is a positive integer, we define the subdivision A(n) of the circle to 
be the set {ao, a i, • • • , a„_ i} of points on the circle T, starting with ao = (1, 0), 
and continuing counterclockwise around the circle with equal spacing. So 


Optional section. 


100 3 Topological Dimension 


27 jj 2nj 

cos , sin 

n n 


When n is large, consecutive points <Zj_i and aj are close together. Sometimes 
we may write a n = ao- 

Given a continuous function g: S — > S, consider the image points g(ao), 
g(a±), • • • , g(a n ). They may not occur in order around the boundary of the cir- 
cle. Assuming that two consecutive points g{dj- 1 ) and g(cij) are close together 
(say, no more than 1/4 of the circumference apart), let Uj be the shorter of 
the two arcs of the circle with endpoints g{dj- 1 ) and g(dj). If g{a,j-\) = g(a,j), 
then Uj is just that single point. Now if y £ S is not one of the points g(a,j), 
we let N(A(n),g,y) denote the number of the intervals Uj that y is in. Let 
N(A(n),g,y ) be the residue* modulo 2 of N(A(n),g,y). 



Fig. 3.3.1. Illustration for Theorem 3.3.1 


I claim that the number ./V(A(n),y,y) is independent of the choice of y, 
as long as it is not one of the points g(dj). Indeed, think of moving y around 
the circle. As long as we do not cross one of the points g(cij ), the number 
N(A(n),g,y) remains unchanged. When we do cross one of the points g(a.j), 
the number N(A(n),g,y) may remain unchanged (if the two arcs Uj and 
Uj . |_i are on opposite sides of g(a,j )) or it may increase or decrease by 2 (if the 
two arcs Uj and Uj + \ are on the same side of g(aj)). It any case, the parity 
N(A(n),g,y) remains unchanged. 

Therefore we will write N(A(n),g) for the value of all N(A(n),g,y). 

Now let / : D — > S be continuous, and suppose that f(x) = x for all x £ S. 
For 0 < r < 1, let 

g r (x) = f(rx) for x £ T. 


There are only two possible values: 0 represents “even” and 1 represents “odd’.’ 
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Since D is compact, the function / is uniformly continuous. Therefore there 
is S > 0 so that \x — y\ < S implies \f{x) — f(y)\ < 1. If n is large enough, 
consecutive points aj of the subdivision A(n) are within distance S of each 
other. For this value of n, all of the functions g r satisfy the assumption required 
above that consecutive points g r (a,j_ i), g r {oij) have distance less than 1/4 of 
the circumference of the circle. Fix this value of n. 

Consider the numbers N(A(n),g r ). The function g\ is the identity on S, 
so clearly N(A(n),gi) = 1. The function go maps everything to the single 
point /( 0), so N(A(ri),go) = 0. Now I claim that both of the sets { r £ [0, 1] : 
N(A(ri),g r ) = 0} and {r S [0,1] : N(A(n),g r ) = 1} are open in [0,1]. Fix 
some value r 0 £ { r £ [0, 1] : N(A(n),g r ) = 0}. Choose a point y £ S not 
equal to any of the points g ro (a,j). There is a minimum distance e from y 
to the set {ffr 0 (ao): 9r 0 {ai), ■ • • , g ro (a„_i)}. By the uniform continuity of /, 
there is S > 0 so that if \r — r 0 | < S , then | g r (a,j) — g ro ( a j)\ < e for all 3- This 
means that y lies in exactly the same arcs defined by g r as arcs defined by 
g ro . So N(A(n),g r ) = N{A(ri),g ro ). Therefore Bg(ro) lies entirely in the set 
{r€ [0,1] : N(A(n),g r ) = 0 }. So it is an open set. 

Thus [0, 1] is the union of two disjoint nonempty subsets, both open in 
[0, 1]. This is impossible by Corollary 3.1.5. So the assumption that the func- 
tion / exists is not tenable. □ 

An easy consequence is Brouwer’s fixed point theorem (for the disk). We 
will use a calculation, which is left to the reader. You can think of it as a test 
of your analytic geometry and algebra skills. 

Exercise 3.3.2. Let (x,y) and (a,b) be distinct points in the disk D. Then 
the point (u, v) where the ray from (a, b) through (x, y ) intersects the circle 
is given by u = x + t(x — a) and v = y + t(y — b) 1 where 

— (a; — a)a; — (y — b)y + y/(x — a) 2 + (y — b ) 2 — ( ay — bx ) 2 
(x - a) 2 + (y- b) 2 ' 

Theorem 3.3.3 (Brouwer’s Fixed Point Theorem). If g: D — > D is 

continuous, then there is a point x £ D with g{x) = x. 

Proof. Suppose there is no such fixed point. Define a function / as follows. 
If x € D, then g{x) ^ x is also in D\ let f{x) be the point where the ray 
from g(x) through x intersects the circle S. By Exercise 3.3.2, we can see 
that the function / is continuous. Also, f(x) = x if x £ S. This contradicts 
Theorem 3.3.1. So in fact g has a fixed point. □ 

Topological Dimension of the Plane 

Now I will show that the plane R 2 has covering dimension 2. We already 
know (Exercise 3.2.16) that CovR 2 < 2. To show CovK 2 > 2 it is enough by 
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Fig. 3.3.2. Illustration for Exercise 3.3.2 


Theorem 3.2.13 to show that Cov M > 2 for a certain closed subset M of K 2 . 
The subset M will be a filled-in equilateral triangle, including the edges, the 
vertices, and the inside. 

Theorem 3.3.4. Let M be a filled-in equilateral triangle. Then Co vM > 2. 

Proof. Assume, for purposes of contradiction, that Co vM < 1. Since the 
triangle M is homeomorphic to the disk D, Brouwer’s fixed point theorem 
also holds in M. Label the vertices of the triangle a, b , c, and the edges A, B, C 
in such a way that A ft B = {c}, BC\C = {a}, and C D A = {b}. The three 
subsets A, B : C are closed, and A D B fl C = 0. By Proposition 3.2.3, there 
exist open sets U, V, W with U D A, VDB, WDC, UUVUWDM, and 

unvnw = 0 . 



Fig. 3.3.4. Illustration for Theorem 3.3.4 
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Define a function a: M — > M as follows. The distance dist(x, S \ U ) to 
the complement of U is a continuous function of the point x, because of the 
Lipschitz condition 

I dist(x, S\U) — dist (y, S \ U) I < \x — y\. 


Define 

a{x)= ifX ^ a ’ 
la, if x = a. 

The point a belongs to both B and C, and therefore to V fl W; so a ^ U and 
thus dist(x, S \ U) < \a — x\. So a{x) is a point on the line segment joining x 
and a. And therefore a is continuous even at the point a. But M is convex, 
so a(x) £ M. If x £ U, then dist(x, S\U) > 0, so we have a(x) y^ x. (Recall 
a ^ U.) On the other hand, if x £ U, then a{x) = x. 

Define corresponding functions: 


f3(x) 

7(x) 


x + dist |l-f J vV) (b-x), 


if x ^ b, 
if x = b, 


dist(a;,5\Ty) 

\c-x\ 


(c- x), 


if i/c, 
if x = c. 


These map M continuously into M. Also x £ V if and only if (3(x) ^ x; and 
x £ W if and only if y(x) yf x. Now define 

<p(x) = ^ (a(x) + (3(x) + 7 (x)) , 

the centroid of the three points a(x),/3(x), j(x). This is a continuous function 
of X, and maps M into M because M is convex. 

By the Brouwer fixed point theorem, ip has a fixed point in M, p(xq) = Xo- 
I claim that Xo ^ A. Indeed, if xq £ A C [7, then a(xo) is not on the edge 
A, and thus the centroid <p(x o) is not in A, so <p(x o) yf xo- Similarly, Xq ^ B 
and Xo ^ C. So Xo is strictly inside the triangle. Now consider which of 
the sets U, V, W the point Xo belongs to. What if Xo belongs to only one 
of the three sets? Say x 0 £ U, x 0 ^ V, xo ^ W. Then a(x 0 ) y^ x 0 but 
/?(xo) = xo, 7 (xo) = Xo, so <^(xo) y^ xo- What if xo belongs to exactly two 
of the sets U, V, W? Say xo £ U, xq £ V but xo ^ W. Then the three points 
a(xo), /3(xo), y(x) = xo are not collinear, so their centroid ip(x o) is not equal 
to a vertex xq. The only remaining cases (xo belongs to all three of the sets 
U, V , W or to none of them) contradict the way we obtained the sets to start 
with. So we have a contradiction. This shows that CovM < 1 is false. □ 


Exercise 3.3.5. Assume known the 3-dimensional Brouwer fixed point theo- 
rem: a continuous function from the closed ball in R 3 into itself has a fixed 
point. Use this to prove that the covering dimension of R 3 is 3. 
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3.4 Inductive Dimension 

Dimension zero was characterized by the existence of a clopen base for the 
topology. The (small) inductive dimension generalizes this. 

Hermann Weyl explained dimension like this:* “We say that space is 3- 
dimensional because the walls of a prison are 2-dimensional.” 

If we have a point that we want to imprison, we can use a small cube as 
the prison. By making the cube small enough, when the point is forbidden 
to move through the faces of the cube, it can be confined to a very small 
region. The cube consists of 6 plane faces; we need to know that they are 2- 
dimensional. A point living in these faces (Flatland) can be imprisoned using 
a small circle. So saying that the faces of the cube are 2-dimensional requires 
knowing that a circle is 1-dimensional. A point living in the circle (Lineland) 
can be imprisoned using just 2 points as prison walls. So we need to know 
that a 2 point set is O-dimensional. Finally, a point living in the 2 point set 
(Pointland) is already unable to move. So we need no prison walls at all. This 
will be the definition of a O-dimensional set. 

The Small Inductive Dimension 

This is defined in an inductive manner. Each metric space S will be assigned 
a dimension, written indS 1 , chosen from the set {— 1, 0, 1, 2, 3, • • • , oo}, con- 
sisting of the integers > — 1 together with an extra symbol oo. The empty 
metric space 0 has ind 0 = —1. If k is a nonnegative integer, then we say that 
ind S < k iff there is a base for the open sets of S consisting of sets U with! 
ind dU < k — 1. We say ind S' = k iff indS < k but indS ^ k — 1. Finally, 
if indS < k is false for all integers k , then we say indS = oo. The small 
inductive dimension is also known as the Urysohn-Menger dimension or 
weak inductive dimension. 

Note: by Proposition 3.1.11, if S is a separable metric space, then we have 
Cov S = 0 if and only if ind S = 0. This equivalence may fail for nonseparable 
spaces [55, Chap. 7, Sect. 4]. In fact, we will see later that if S is a compact 
metric space, then Cov S = ind S. 

Exercise 3.4.1. Show indR = 1. 

Since the small inductive dimension is defined inductively, it is often possible 
to prove things about it by induction. 

Theorem 3.4.2. Topological dimension is a topological property: If S and T 
are homeomorphic, then ind S = ind T . 


* I have to quote from memory, because I cannot seem to find this quotation now. 
t Recall the notation dUiov the boundary of U . 
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Proof. The proof is by induction on indS 1 . If indS 1 = — 1, then S is empty; 
since T is homeomorphic to S , it is also empty, so indT = — 1. 

Suppose the theorem is known for spaces S with indS < k , and consider 
a space S with indS = k + 1. Let h: S — > T be a homeomorphism. There is 
a base B for the open sets of S consisting of sets B with indclB < k. Now 
{ h[B] : B £ 1 3 } is a base for the open sets of T. If B £ B, then h[dB] = dh[B] . 
The restriction of h to dB is a homeomorphism. By the induction hypothesis, 
ind dh[B] = ind dB < k. So we see that there is a base for the open sets 
of T consisting of sets with boundary of dimension < k. This shows that 
indT < k + 1. But if indT < k , then the induction hypothesis would show 
ind S' < k , which is false. So indT = k + 1. Therefore, by induction we see 
that if ind S is an integer, then ind S = ind T. 

If indS = oo, then indT = k is false for all integers k, so also indT = oo. 

So in all cases we have ind S = ind T. □ 

Theorem 3.4.3. Let S be a metric space, and let T C S. Then indT < ind S. 

Proof. This is clear if ind S = oo. So suppose ind S < oo. The proof proceeds 
by induction on ind S. 

If indS = — 1, then S is empty, so clearly T C S is also empty, and hence 
ind T = -1. 

Suppose the theorem is true for all pairs S, T with T C S and indS < k. 
Consider a pair T C S with indS = k + 1. I must show that there is a base 
for the open sets of T consisting of sets with boundary of dimension < k. So 
let x £ T, and let V be an open set in T with x £ V. I must find an open set 
U in T with x £ U C V and ind drU < k. [Note that the boundary of a set 
in T may be different than the boundary of the same set in S; so the space 
is indicated as a subscript.] Now since V is open in T, there exists a set V 
open in S with V = V C\ T._Since indS 1 < k + 1, and x £V , there is a set 
U open in S with x £ U C V and inddsU < k. Let U = U (~1 2k Then U is 
open in T, and x £ U C V. Now by Theorem 2.2.20, drU C dsU , so by the 
induction hypothesis, we have ind DtU < ind dsU < k. Thus there is a base 
for the open sets of T consisting of sets U with ind drU < k. This means that 
indT < k + 1. 

Therefore, by induction, the theorem is true for all values of ind S. □ 

The triadic Cantor dust C is homeomorphic to the space {0, 1}^ of 
infinite strings based on the two-letter alphabet {0,1} (Proposition 2.6.3). 
Any metric space homeomorphic to these spaces may be called generically 
a “Cantor dust” or a “Cantor set’.’ The Cantor dust is a universal zero- 
dimensional space in the following sense: 

Theorem 3.4.4. Let S be a nonempty separable metric space. Then ind S = 0 
if and only if S is homeomorphic to a subset of the space {0,1}^. 
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Proof. Suppose S is homeomorphic to T C {0,1}^“^. By Theorem 3.4.2, 
ind S' = indT. By Theorem 3.4.3, indT < ind{0, 1}^. But ind{0, 1 }(A) = 0. 
Therefore ind S < 0. Since S ^ 0, we have ind S = 0. 

Conversely, suppose ind S' = 0. There is a base 23 1 for the open sets of S 
consisting of clopen sets. By Theorem 2.3.1, there is a countable 23 C still 
a base for the open sets. Write 23 = {U\, U 2 , • • • }. (If 23 is finite, repeat the 
basic sets over and over.) For notation, we will use Ui(l) = Ui and Ui(0) = 
S\ Uf, they are all clopen sets. If a £ {0, l} (fe \ say a = eie 2 • • • e k , let 
U(a) = Ui(ei) (~l U 2 (e 2 ) n • • • D U k (e k ). 

Define a map h: S — > {0, 1}^ as follows: given x £ S, the ith letter of h(x) 
is 0 or 1 according as x belongs to t/j(0) or Ui( 1). So this means h(x) £ [a] if 
and only if x £ U(a). Thus h _1 [[a]] = U(a). 

Now I claim h is one-to-one. Indeed, if x y, then S \ {y} is an open 
set containing x, so there is i with x £ Ui C S \ {y}, and therefore h(x) ^ 
h(y). This shows that h is one-to-one. So the inverse function /i -1 : h[S] — ► S 
exists. Now the sets [a] constitute a base for the open sets of {0,1}^), and 
/i -1 [[a]] = U(a) is open for every a, so h is continuous. Similarly, the sets Ui 
constitute a base for the open sets of 5, and 

h[Ui] = (J (h[S] n [al]) 
aefo.i}^- 1 ) 

is open in h[S] for every i. So h _1 is continuous. This means that h is a 
homeomorphism of S onto /ifS) C {0, 1}^. □ 

Exercise 3.4.5. Let S, T be metric spaces. Then ind(S l x T) < ind S + ind T. 

There are examples showing that strict inequality is possible [22, Exam- 
ple 1.5.17], 

The Large Inductive Dimension 

In Proposition 3.1.8, covering dimension zero was characterized in terms of a 
separation property. A generalization of this will be considered next. 

Let A and B be disjoint subsets of a metric space S. We say that a set 
L C S separates A and B iff there exist open sets U and V in S with 
U n V = 0, U D A, V A B, and L = S \ (U U V). (See Fig. 3.4.6.) So 
Proposition 3.1.8 says that in a zero-dimensional space S , any two disjoint 
closed sets are separated by the empty set. Note that a space S has ind S < k 
if and only if a point {x} and a closed set B not containing x can be separated 
by a set L with indL < k — 1. Indeed, there is a basic open set U with 
x £ U C S\B and L = dU separates {x} and B. 

The large inductive dimension is a topological dimension closely re- 
lated to the small inductive dimension. Each metric space S will be assigned 
an element of the set {— 1, 0, 1, 2, • • • , 00 }, called the large inductive dimen- 
sion of S, written Ind S. To begin, Ind0 = —1. Next, if k is a nonnegative 
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U 


Fig. 3.4.6. L separates A and B 

integer, we will say that Ind S < k iff any two disjoint closed sets in S can be 
separated by a set L with IndL < k — 1. We write Ind S' = k iff IndS < k 
but Ind S k — 1. We write Ind S = oo iff Ind S < k is false for all integers k. 
The large inductive dimension is also called the Cech dimension or strong 
inductive dimension. 

Exercise 3.4.7. Show IndM = 1. 

Exercise 3.4.8. Let S be a metric space. Then ind S < Ind S. 

In general metric spaces, the large and small inductive dimension are not 
necessarily equal. But in separable spaces they are equal. This will be proved 
below (Corollary 3.4.17.) 

Note that Proposition 3.1.8 now has the formulation: Cov S = 0 if and only 
if IndS = 0. Proposition 3.1.11 says that if S is separable, then Cov S = 0 
if and only if ind S = 0. So we conclude that all three definitions of “zero- 
dimensional” coincide for separable metric spaces. 

Sum Theorems and Separation 

Lemma 3.4.9. Let S be a metric space, let A and B be disjoint closed sets, 
and let T C S. (a) Let U and V be open sets with U D A, V D B , and 
U C\V = 0. If L' C T separates (in T) the sets T C\U and T (~l V , then there 
is a set ICS that separates A and B in S with L (~l T C L' . (b) Suppose, 
in addition, that T is closed. Then, for any set L' C T separating T (~l A and 
T (~l B in T, there is L C S separating A and B in S with LnT C L' . 

Proof, (a) Since L' separates T fl U and T n V, we have T\L' = U' U V 7 , 
with U' and V' open in T, T fl U C U' , and T fl V C V' . Now I claim that 
AnV 7 = 0. Indeed, U d V' = T n U C V' C U' C V = 0, and U is open, so 
U fl V' = 0 and thus AnV' = 0. Similarly, B C\U' = 0. Now U 1 and V' are 
disjoint and open in U' U V' , so U' fl V' = 0 and U' fl V' = 0. But then 

(A U U') C(BU V') = (Auu')n(Buv 7 ) 

= (4nB)u(AnF)u {W n b) u {u' n F 7 ) 

= 0, 
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and similarly ( A U U') n(BU V') = 0. So (by 2.2.30(1)) there exist disjoint 
open sets U" and V" with A U U' C U" and B U V' C V" . Then let L = 
S \ (U" U V”). It separates A and B. Also 

T n L = T \ (TJ" U V") 

CT \ (XT' U V') 

= L' . 

(b) Since L' separates TD A and TCiB in T, there are disjoint sets U' and 
V', open in T, with T 0 A C U' , T D B C V', and T\(U'U V ') = L' . Now 
the sets A and (T \ U') U B are disjoint and closed, so there exists an open 
set U" with 

A C U" C TF C S \ ((T \ U') U B) . 

Similarly, B and (T \ V') U U" are disjoint and closed, so there exists an open 
set V" with 

B C V" C V" C S\ {{T\V') \JU"). 

So A C U", B C V " , and U" D V" = 0. Then apply part (a). □ 

This enables us to prove a generalization of Proposition 3.1.8. 

Corollary 3.4.10. Let S be a separable metric space , let A and B be disjoint 
closed sets in S, and let T C S be a subset with indT = 0. Then there is a set 
L that separates A and B in S such that L CiT = 0. 

Proof. Choose U and V open sets with A C [7, B C V , and U fl V = 0. The 
sets U fl T and V D T are disjoint and closed in T, which is zero-dimensional, 
so they can be separated by L' = 0. Apply Lemma 3.4.9 to get L separating 
A and B with L fl T C L' = 0. □ 

Here is a sum theorem for sets that are not necessarily closed. 

Theorem 3.4.11. Let S be a separable metric space , and let A, B C S . Then 
ind(A U B) < 1 T ind A + ind B. 

Proof. If either ind A = oo or indH = oo, then the inequality is clear. So 
suppose they are both finite, say ind A = m, ind B = n. The proof proceeds 
by induction on the sum m + n. 

For n + m = —2, both A and B are empty, so A U B is also empty, and 
ind (A U B) = -1 = 1 + (-1) + (-1). 

So assume that the result is known for smaller sums. Let x £ A U B. 
Then either x £ A or x £ B. Suppose x £ A. Let V be open in A U B with 
x £ V. Then the two sets {x} and A \ V are separated in A by a set L' with 
indL' < m — 1. So by Lemma 3.4.9, there is a set L that separates {x} and 
(A U B) \ V in A U B , and L D A C L' . Now L = (L n A) U (L D B), so by 
the induction hypothesis, ind L < 1 + (m — 1) + n = m + n. Therefore I have 
shown that ind(A U B) < m + n + 1, as required. □ 
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Corollary 3.4.12. The union of n + 1 zero- dimensional sets in a separable 
metric space has small inductive dimension < n. 

Next we prove the sum theorem for higher dimensions: 

Theorem 3.4.13. Let k be a nonnegative integer , and let S be a separable 
metric space. Let closed sets T n C S satisfy indT„ < k for n = 1, 2, ■ • • . Then 

ind U„ GN T ™ < k - 

Proof. Write T = U„ g n^- T de resu lt is clearly true for k = oo. So suppose 
k is finite. The proof is by induction on k. The case k = 0 has been done 
already. Suppose k > 1, and assume the result is known for smaller values. 

For each n, let B n be a base for the open sets of T n consisting of sets with 
boundary of dimension < k. By the Lindelof property, we may assume that 
the bases B n are countable. For all n and all U £ B n , we have ind dr n U < k— 1. 
By the induction hypothesis, the countable union 

y =U U (w) 

n G N (7es„ 

also has ind Y < k — 1. But the space Z n = T n \Y has the family 

{ U \ Y : U £ B n } 

as a base for its open sets, and the sets U\Y are clopen in Z n . So ind Z n < 0. 
Now consider the union 

Z = (J Z n . 

ngN 

Each Z n = T n \Y = T n D Z is closed in Z, so ind Z < 0. Thus (by Theo- 
rem 3.4.11) 

indT = ind(FUZ) < 1 + (k - 1) +0 = k. □ 

Examination of the proof yields the converse of Corollary 3.4.12: 

Corollary 3.4.14. Let S be a separable metric space with finite small induc- 
tive dimension k. Then S is the union of k + 1 zero- dimensional sets. 

Theorem 3.4.15. Let S be a separable metric space, let A and B be disjoint 
closed subsets. Let k > 0 be an integer, and let T C S have small inductive 
dimension k. Then there is a set L separating A and B with ind(TnL) < k— 1. 

Proof. The case k = 0 has already been done (Corollary 3.4.10). So suppose 
k > 1. By Corollary 3.4.14, we can write T = Y U Z, with indF = k — 1 and 
ind Z = 0. Now (again by Corollary 3.4.10) there is a set L separating A and 
B with L H Z = 0. Then LflTCy. So ind(T OT) < ind Y < k — 1. □ 

Another consequence is this separation theorem, which generalizes Propo- 
sition 3.1.8 in another way: 
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Corollary 3.4.16. Suppose S is a separable metric space with indS < n— 1. 
Let A\, Bi, A 2 , B 2 , ■ • ■ , A n , B n be 2 n closed sets in S with AiH Bi = 0 for all 
i. Then there exist sets Li,L 2 ,-- ■ ,L n such that Li separates Ai and Bi for 
all i, and the intersection nr=i ^ = 0 . 

Proof. First, there is a set L\ that separates A\ and B±, such that indLi < 
n— 2. Applying the theorem with T = L 1 , we get a set L 2 that separates A 2 
and B 2 with ind(L 1 nL 2 ) < n—3. Continuing in this way, we get L 1; L 2 , • • • , L n 
with ind(Li n L 2 D • • • n L n ) < — 1, so it is empty. □ 


ind = Ind = Cov 


It is true for general metric spaces S that ind S' < IndS = CovS, and for 
separable metric spaces S that Cov S = ind S = Ind S. (See, for example, [35, 
Chap. V] or [22, Theorems 4.1.3, 4.1.5] or [22, Theorem 4.1.5]. An example 
with ind S = 0, Ind S = Cov S = 1 is described in [55, Chap. 7, Sect. 4] .) I will 
prove only this much: For separable S, CovS < indS = IndS; for compact 
S, Cov S = ind S = IndS. 

Proposition 3.4.17. Let S be a separable metric space. Then IndS < indS. 

Proof. This is clear if indS = 00 . So suppose indS = k is finite. The case 
k = 0 is from Theorems 3.1.11 and 3.1.8. Suppose k > 1 and the result is 
known for smaller values of the dimension. Let A and B be disjoint closed 
sets in S. Then by Theorem 3.4.15, A and B can be separated by a set L with 
indL < k — 1. But by the induction hypothesis, IndL < k — 1. Therefore we 
have Ind S < k. □ 

Theorem 3.4.18. Let S be a separable metric space. Then CovS < indS. 

Proof. The result is clearly true if indS = 00 . So suppose we have indS < 
00 . Let n = indS. I must show CovS < n. Now by Corollary 3.4.14, 
there exist zero-dimensional sets Y[ , Y 2 , • • • , Y n+ i with S = US^-- Let 
{ C/i , t/ 2 , • • • ,Uk} be a finite open cover of S. Then for each j , the family 
{Z7i nY j ,U 2 nY j ,--- , Uk n Yj} is a finite open cover of Yj. Now ind Y, = 0, 
so CovYj = 0, and therefore there exist disjoint open sets Bij C Ui with 
Ut=i 2 Yj. Now the family 

23 = { B^ : l<i<fc, l<j<n + l} 

covers S and refines {C/i , ■ ■ - ,Uk}- If we have any n + 2 elements B t j of 

23 , some two of them have the same second index j , so the intersection of the 
n + 2 sets is empty. Therefore 23 has order < n. This completes the proof 
that Cov S < n. □ 

The second part of the equality (for compact spaces) of the covering di- 
mension with the other topological dimensions will be proved in a way that 
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also establishes another useful characterization of topological dimension for 
compact spaces. A function h: W — * S is said to be at most n-to-one iff for 
each x £ S there are at most n points y £ W with h(y) = x. 

Theorem 3.4.19. Let S be a compact metric space. Let n > 0 be an integer. 
Then the following are equivalent: 

(1) ind,? < n. 

(2) Cov S < n. 

(3) There is a compact zero- dimensional metric space W and a continuous 
map h of W onto S that is at most ( n + 1) -to- one. 

Proof. (1) => (2) is Theorem 3.4.18. 

(2) => (3). Begin with the cover ho = {5}, and (recursively) choose finite 
open covers Uk of order < n such that the mesh of Uk is less than half of 
the Lebesgue number of Uk-i- We can use this sequence of covers to define a 
tree: the nodes of the /cth generation are the elements of Uk', for each U £ Uk, 
there is V £ Uk-i with U C 1/; let V be the parent of U. (If there is more 
than one such set V, choose one.) If we label the edges of the tree (an edge 
goes from the parent to the child) by a countable set E , then the set of nodes 
of the tree is in one-to-one correspondence with the set El of finite paths 
in the tree starting at the root z. Write U a for the open set corresponding to 
the path a £ E^\ Thus, the cover Ufc is { U a : a £ E^> }. Let W = be 
the set of infinite paths starting at the root. As usual, W is an ultrametric 
space under the metric £> 1 / 2 , and therefore indW = 0. Also W is compact, 
since each node has only finitely many children. 

Define the map h: W — > S as follows: If 0 £ Eg is an infinite path, the 
sets U a \k are compact, U a \$ D U a \\ D ■ ■ ■ , and limfc^oo diamt/o-ffe = 0. The 
set 

0 U a \k = ("') U a \k 
fee n fceN 

is nonempty (by compactness) and has diameter 0, so it consists of a single 
point. Let that point be h(o). Now /i[[cr|A]] C U„\k, so h is continuous. Since 
the families Uk cover S, we may deduce that h is surjective. 

Finally, I must show that the map is at most (n + l)-to-one. Suppose ay, 
CT 2 , • • • , o’n +2 £ W are all different. Then there is a generation k so that 

o 1 \k,a 2 \k, - ■ ■ ,cr n+2 \k 

are all different. But Uk has order at most n, so 

n + 2 

pi u ailk = 0. 

i = 1 

Therefore not all of the points h{(ii) are equal. 
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(3) => (1). The proof is by induction on n. The case n = 0 and the 
induction step for n > 1 begin in the same way, however. 

Let x £ S and let e > 0. I must show that there is an open set V C S 
such that x £ V C B e (x) and indcW < n — 1. The set D = h~ l [{x}] has at 
most n+ 1 elements, say D = {zi, z%, • ■ ■ ,z m }. The function h is continuous 
and W is zero-dimensional, so there exist clopen sets Ui C W with z-i £ Ui 
and h[Ui\ C B e (x). Let F = [J™ 2 h[Ui\. Now the sets Ui are clopen and W 
is compact, so the set Ui are compact, and thus F is compact and therefore 
closed. Let V be the interior of F, that is, the set of all the interior points of 
F. Then V is an open set. 

First, I claim that x £ V, that is, x is an interior point of F. Suppose 
not. Then there is a sequence (x*) in S \ F with Xk — > x. Choose points 
yk £ W with h{yk) = Xk- Taking a subsequence, we may assume that (yk) 
converges. Its limit y satisfies h(y) = x, so in fact y = Zi for some i. Now 2 ,; is 
an interior point of Ui , so yk £ Ui for some k, and therefore Xk = h(jjk) £ F. 
This contradiction shows that x is an interior point of F. 

Note that dV C F \ V. Now consider the subset 

Wi = h-'idv] \ <Jh U U 2 U •• • U U m ) 

of W. It is closed, hence compact and zero-dimensional. I claim that h maps 
Wi onto dV. Indeed, if y £ dV, then there is a sequence (yk) in S\F with 
yk —■ > y ■ Choose Wk £ W with h(u>k) = yk\ by taking a subsequence, we 
may assume that Wk — > w for some w £ W. Of course, h(w) = y. Now 
Wk & U\ U U 2 U • • • U U m , so w U\ U U 2 U • • • U U m , and therefore y £ W\. 

Next I claim that ind dV < n — 1. We must distinguish the cases n — 0 
and n > 1. First, suppose n = 0. Then h is one-to-one. Now dV C F and 
W\ C h _1 [F] \ U\ = 0. Therefore dV = 0, or indcIF = — 1 = n — 1. Next 
suppose n > 1 and the result is known for smaller values of n. If y £ dV, then 
since dV C F, the map h sends at least one point of U± U U 2 U • • • U U m onto y, 
so at most n points of W\ are sent onto y. Thus the restriction h : W\ — > dV 
satisfies the hypotheses of the theorem for n — 1. By the induction hypothesis, 
ind dV < n — 1. 

This completes the proof that ind S < n. □ 

In the corollary, the condition in (b) is called “<j-compact” and the condi- 
tion in (c) is called “locally compact’.’ 

Corollary 3.4.20. (a) If S is a compact metric space , then Cov S = ind S = 
Ind S', (b) If S is a metric space and S is the union of countably many compact 
subsets, then Cov S' = indS = IndS. (c) If S is a separable metric space, and 
every point of S is in the interior of some compact subset, then Cov S = 
ind S = Ind S. 

Proof, (a) indS = IndS by Exercise 3.4.8 and Proposition 3.4.17; indS = 
Co vS by Theorem 3.4.19. 
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(b) Now S = Un.eN Fn where F n is compact. A compact metric space is 
separable, so S is separable. As before, ind S' = IndS by Exercise 3.4.8 and 
Proposition 3.4.17; Co v S < indS by Theorem 3.4.18. So it remains to prove 
Cov S > ind S. There is nothing to do if Cov S = oo. So assume Cov S = k 
for some integer k. We will deduce that ind S' < k. Since F n C S, we have 
Co v F n < k by the subset theorem 3.2.13. So indify < k by (a). Compact 
subsets are closed, so indS < k by the sum theorem 3.4.13. 

(c) The collection of all open sets V such that V is compact is an open 

cover of S. By the Lindelof property, there is a countable subcover. So S 
satisfies (b) . □ 

Although the result Cov S = ind S = Ind S is true for any separable metric 
space, we have not proved it here. There are cases not covered by this corollary. 
For example, it can be proved that the separable metric space R\Q of irrational 
numbers satisfies none of the conditions (a), (b), (c). But it does satisfy ind = 
Ind = Cov = 0. 


3.5 *Remarks 

The classical reference on topological dimension is the book by W. Hurewicz 
and H. Wallman [35]. For this chapter, I have used that reference, as well 
as [22, 53, 54, 55]. Modern topological dimension theory deals with “topolog- 
ical spaces” and not just metric spaces. Topologists usually write “dim” for 
the covering dimension. But in this book, that is reserved for the Hausdorff 
dimension. 

Warning: The definition of “order” for a family of sets is not the same in 
all references. In some texts it is adjusted by 1. The definition used in this 
book (and many others) has the advantage that dimension n corresponds to 
covers of order n. But the disadvantage that “family of order n” means that 
at the sets meet at most n+ 1 at a time. The alternative definition (also used 
in many books) has the advantage that “family of order n” means that at 
the sets meet at most n at a time, but the disadvantage that dimension n 
corresponds to covers of order n + 1. 

The small inductive dimension uses the catch-all symbol “oo” for spaces 
that do not have small inductive dimension otherwise specified by the defini- 
tion. There is the possibility of a more refined classification of metric spaces 
of infinite dimension. The way in which the definition is formulated makes the 
use of transfinite ordinal numbers quite natural. If a is an ordinal number, and 
S is a metric space, then we say that ind S < a iff there is a base 23 for the open 
sets of S such that indd?7 < a for U £ 23. Questions: Which ordinals a are 
the dimension of some metric space? Separable space? Compact space? Do we 
still need an extra symbol oo, or does every metric space admit some ordinal 
as its dimension? What happens to the formula ind(AuB) < ind A + ind B + 1 
(when addition is not commutative)? References: [22, p. 50], [35, p. 83]. 
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The proof in Sect. 3.3 that CovK 2 = 2 is a small taste of the mathematical 
field known as algebraic topology. A ten-cent description of algebraic topol- 
ogy might say that an algebraic object is associated with a situation from 
topology, in such a way that useful information can be obtained about the 
topology from the algebra. In the proof given in Sect. 3.3, the algebraic object 
that is used is the integers modulo 2. This is not a very sophisticated algebraic 
object, but it is enough to distinguish between two kinds of functions from 
the circle to itself. 

Hints for some exercises. 

Exercise 3.1.12: The intervals (a, b) HQ with irrational a, b form a base for 
the topology of Q. 

Exercise 3.1.13: Show there is a positive minimum distance between the 
sets fi[K ], and remember the proof that the Cantor set is zero-dimensional. 

Exercise 3.2.17: Define a tiling of JR 3 by congruent rectangular solids that 
meet at most 4 at a time. Do it by “layers” where each layer looks like 
Fig 3.1.2, but adjacent layers are offset with respect to each other. 

Exercise 3.2.20: Heighway dragon, 2; Koch curve, 1; the McWorter penti- 
gree, 1; the twindragon, 2; the Eisenstein fractions, 2; 120-degree dragon, 2. 

Exercise 3.4.1: Use the usual base of open intervals. 

Exercise 3.4.5: d(U x V) = ( dU x V) U (U x dV ). 

Exercise 3.4.7: If A, B are disjoint nonempty closed sets, then they are sep- 
arated by L = { x £ R. : dist(a;, A) = dist(a;, B) }. Show L is zero-dimensional. 


A line is breadthless length. 
A surface is that which has length and breadth only. 
A solid is that which has length, breadth, and depth. 
— Euclid, The Elements (translation by T. L. Heath) 
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Self-Similarity 


There are several variant notions of “dimension” that may be classified as 
fractal dimensions. Among the most widely used of these are the Hausdorff 
dimension, the packing dimension, and the box dimension. They will be con- 
sidered in Chap. 6. We begin here with the similarity dimension , a fractal 
dimension that is easier to define (but not as useful). 

At the same time, we will formally discuss iterated function systems. 
This is an efficient way of specifying many of the sets that we will be interested 
in. It has been publicized in recent years by Michael Barnsley; see [3], [4]. 


4.1 Ratio Lists 

A ratio list is a finite list of positive numbers, (r'i , r 2 . • • • ,r n ). An iterated 
function system realizing a ratio list (ri, r 2 , • ■ • , r n ) in a metric space S is 
a list (/i, / 2 , ■ ■ ■ , /„), where /,; : S — > S is a, similarity with ratio r^. 

A nonempty compact set K C S is an invariant set or attractor for the 
iterated function system (/i, / 2 , •••,/„) iff AT = fi[K] U f 2 [K\ U • • • U f n [K\. 
The reason for the term “attractor” is in Corollary 4.1.4. We may sometimes 
say set attractor to distinguish it from other types of attractors that also 
occur. 

Examples: The triadic Cantor dust (p. 6) is an invariant setfor an iter- 
ated function system realizing the ratio list (1/3, 1/3). The Sierpinski gasket 
(p. 9) is an invariant set for an iterated function system realizing the ratio list 
(1/2, 1/2, 1/2). 

The sim-value (or similarity value) of a ratio list (ri, r 2 , ■ ■ • , r n ) is the 
positive number s such that r| + + • • • + r® = 1. 

Theorem 4.1.1. Let (ri, r 2 , • • • , r n ) be a ratio list. Suppose each r* < 1. Then 
there is a unique nonnegative number s satisfying Y^i=i r t ~ 1- number 
s is 0 if and only if n = 1. 
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Proof. Consider the function <P: [0,oo) — > [0,oo) defined by 

n 

®( S ) = J2 r i- 

i = 1 

Then <P is a continuous function, <£(0) = n > 1 and Hindoo <£(s) = 0 < 1. 
Therefore, by the intermediate value theorem, there is at least one value s 
with <2>(s) = 1. The derivative of <P is 

n 

^2 r t log W 

i — 1 

This is < 0, so # is strictly decreasing. Therefore there is only one solution s 
to <£(s) = 1. If n > 1, then ^(0) > 1, so s yf 0. □ 

A ratio list (rq, r%, ■ ■ • , r n ) is called contracting (or hyperbolic) iff A: < 1 
for all i. Similarly an iterated function system may be called contracting or 
hyperbolic. 

The number s is called the similarity dimension of a (nonempty com- 
pact) set K iff K satisfies a self-referential equation of the type 

n 

K = U 

i—1 

where (/i, / 2 , • • • , f n ) is a hyperbolic iterated function system of similarities 
whose ratio list has sim- value s. 

It is, of course, conceivable that a given set admits two different decom- 
positions, and therefore two different similarity dimensions. So the value s is 
more correctly considered to be a characteristic of an iterated function system 
(the sim-value) rather than of the set attractor K. We will see later that, un- 
der the right circumstances, the similarity dimension of a set K coincides with 
the Hausdorff dimension of K, which is uniquely determined by the set K . 

Consider, for example, an interval [a, 6]. It is the union of two smaller 
intervals, [a, (a + b)/ 2] and [(a + 6)/2, b\. Each of the parts is similar to the 
whole set [a, 6], with ratio 1/2. The sim-value of the ratio list (1/2, 1/2) is the 
solution s of the equation 



so s = 1. The similarity dimension is 1. 

Note, however, that we can also write [0, 1] = [0, 2/3] U [1/3, 1], correspond- 
ing to ratio list (2/3, 2/3), yielding a dimension larger than 1. So in order for 
the similarity dimension to be a characteristic of the set, we will need to limit 
overlap in some way. This is discussed in Sect. 6.4. 

What do we want to do for a rectangle [a, b] x [c, d] in R 2 ? It is the union 
of four rectangles with sides half the size. The ratio list is (1/2, 1/2, 1/2, 1/2) 
and the similarity dimension is 2. 
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How about a closed ball B r (a) in 1R 2 (a “disk”)? Some geometry shows 
that a disk is not the union of finitely many disks of smaller radius. This 
illustrates the main drawback of the similarity dimension: it is not defined for 
many sets, even very simple sets. 

Now let us consider a more interesting example. The triadic Cantor dust is 
the invariant set for an iterated function system realizing ratio list (1/3, 1/3) 
(p. 6). So the similarity dimension is the solution s of the equation 



so* s = log 2/ log 3, or approximately 0.6309. This agrees with the assertion 
(p. 2) that the dimension should be less than 1. 

Next take the Sierpinski gasket. The ratio list is (1/2, 1/2, 1/2), so the 
similarity dimension is log 3/ log 2 ss 1.585. It was asserted (p. 9) that the 
dimension should be between 1 and 2. 

Exercise 4.1.2. Calculate similarity dimensions for other sets: the Heighway 
dragon (p. 20); the Koch curve (p. 19); the McWorter pentigree (p. 24); the 
twindragon (p. 33); the Eisenstein fractions (p. 34); 120-degree dragon (p. 23). 

In these examples, we begin with a set, and then try to find a corresponding 
iterated function system. It is often useful to do things the other way around: 
begin with an iterated function system, and use it to obtain a set. 

Theorem 4.1.3. Let S be a nonempty complete metric space, let (ri, • • • , r n ) 
be a contracting ratio list, and let (/i, • • • , /„) be an iterated function system 
of similarities in S that realizes the ratio list. Then there is a unique nonempty 
compact invariant set for the iterated function system. 

Proof. Consider the hyperspace H(<S) of nonempty compact subsets of S, with 
the Hausdorff metric D. Since S is complete, so is H(S'), by Theorem 2.5.3. 
Define a function F : H/S) — > H(5) as follows: 

n 

f(a) = u m. 

i—i 

The continuous image of a compact set is compact (Theorem 2.3.15), and the 
union of finitely many compact sets is compact (Exercise 2.3.14); thus if A is 
compact, then so is F(A). 

I claim that F is a contraction map. Let r = max/?"!, r%, ■ ■ ■ , r n }. Clearly 
r < 1. I will show that 


D(F(A),F(B)) <rD(A,B). 

* Since I am a mathematician, when I write “log” it refers to the natural logarithm. 
But in fact for the quotient of two logarithms, as we have here, it doesn’t matter 
what the base is. 
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Let q > D{A,B) be given. If x is any element of F(A), then x = 
for some i and some x' £ A. Since q > D(A,B), there is a point y' £ B 
with g(x',y r ) < q. But then the point y = fi(y') £ F(B) satisfies g(x,y) = 
rig(x',y') < rq. This is true for all x £ F(A), so F(A) is contained in the rq- 
neighborhood of F(B). Similarly, F(B) is contained in the /'(/-neighborhood 
of F(A). Therefore D (F (A) , F (B)') < rq. This is true for every q > D(A,B), 
so we have D(F{A), F{Bf) < rD(A,B). 

Therefore we have a contraction map F defined on a complete metric 
space H(S'). By the contraction mapping theorem (2.2.21), F has a unique 
fixed point. A fixed point of F is exactly the same thing as an invariant set 
for (/i, / 2 , - - ■ ,fn)- □ 

This proof, together with Corollary 2.2.22, provides a construction for the 
invariant set: 

Corollary 4.1.4. In the notation of Theorem 1.3, if A 0 is any nonempty 
compact set in S, and if 

n 

Ak + i = [J fi[Ak] 

i—1 

for k > 0, then the sequence (A k ) converges in the Hausdorff metric to the 
invariant set of the iterated function system. 

This illustrates the use of the term attractor. 

Exercise 4.1.5. Let (ri,r 2 ,-- - , r n ) be a contracting ratio list. Suppose an 
iterated function system (/i, / 2 , ■ • • , /„) consists not of similarities, but only 
of maps /,; : S — > S satisfying 

Q(fi(x),fi(y)) < n g(x, y). 

Show that if S is complete, then there is a unique invariant set. 

We have been talking about “iterated function systems’.’ What does this 
have to do with “iteration”? Let (f\,f 2 ,fz) be the iterated function system 
associated with the Sierpinski gasket (p. 9). The iteration that we will be 
interested in involves starting with any point a £ R 2 , and then repeatedly 
applying these three functions, in any order. 

Exercise 4.1.6. Let fci, /c 2 , /%, ■ ■ • be an infinite sequence in the set {1, 2, 3}. 
Let a £ l 2 be a point. Let the sequence (x n ) be defined by 

x 0 = a; x n = f kri (x n _ i) for n > 1. 

Then: (1) Every cluster point of the sequence (x n ) belongs to the Sierpinski 
gasket 5; (2) Every point of the Sierpinski gasket is a cluster point of such a 
sequence (x n ) for some choice of kp, (3) There is a point a and choice sequence 
ki so that S is exactly equal to the set of all cluster points of (x n ). 
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A more sophisticated version of (3) says that a “random” choice of (Ay) 
will (with probability one) have cluster set S. This fact has sometimes been 
used to produce a picture of the invariant set for an iterated function system 
on a computer. (See “the chaos game” in [3].) 

Exercise 4.1.7. Let S' be a complete metric space, let (ri,r 2 , — - ,r n ) be a 
contracting ratio list, and let (/i, fi, ■ ■ ■ , f n ) be a realization of the ratio list in 
S. State and prove the appropriate version of Exercise 4.1.6 for this situation. 

Recall the situation from Sect. 1.6: Let b be a complex number, \b\ > 1, 
and let D = {di, g? 2 , • ■ • , dk} be a finite set of complex numbers, including 0. 
We are interested in representing complex numbers (or real numbers) in the 
number system they define. 

Write F for the set of “fractions”; that is, numbers of the form 

-l 

Y a 3 bJ - 

j=-o o 

Analysis of the set F is a “self-similar” set problem: 

Exercise 4.1.8. The set F is nonempty and compact, and is the invariant set 
for an iterated function system of similarities. What is the sim-value of this 
iterated function system? 

There are various ways that such a “number system” can be generalized. 
Consider the following way to define a set in R d . Let b £ R. with \b\ > 1 as 
before, but let D be a finite subset of R d , including 0. Then let F be the set 
of all vectors 

-l 

Y b ° a 3' 

j=—oo 

where ay £ D. Is there an iterated function system to describe FI 

The Menger sponge is a set in R 3 . (See Fig. 4.1.9.) Begin with a cube 
(filled in) of side 1. Subdivide it into 27 smaller cubes by trisecting the edges. 



Fig. 4.1.9. Menger sponge 
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The trema to remove consists of the center cube and the 6 cubes in the centers 
of the faces. That means 20 cubes remain. (The boundaries of these 20 cubes 
must also remain, so that the set will be compact.) Continue in the same way 
with the small cubes. 

Exercise 4.1.10. What is the topological dimension of the Menger sponge? 
What is the similarity dimension of the Menger sponge? 


4.2 String Models 

We will now consider “string models” in general. Given any contracting ratio 
list, there is a realization that is, in an appropriate sense, the “best” realization 
(Theorem 4.2.3). Any other realization of the same ratio list is an “image” of 
this one. One advantage will be seen in Chap. 6 when other fractal dimensions 
are computed. 

Two instances of the string model have already been seen in Chap. 1, 
where the space of infinite strings in the alphabet {0, 1} was used as a model 
for [0, 1] and for the Cantor dust. 


Example 

Before we consider the general model, we will take one more example, the 
Sierpinski gasket. This time we will need to use an infinite ternary tree. 
Each node has exactly three children; each node except the root has exactly 
one parent, and every node is a descendant of the root. To represent this using 
strings, we will need a three-letter alphabet. Any three letters will do; I will 
use this alphabet: E = { L, U, R}. (When I chose the letters I was looking at 
Fig. 1.2.1, and thinking of the words “left” “upper” “right’.’ ) 

We will write (as before) E ^ for the set of all fc-letter strings from this 
alphabet; and E W for the set of all finite strings; and E^ for the set of all 
infinite strings. We may identify E^*' with the infinite ternary tree: the empty 
string A is the root, and if a is a string, then qL is the left child, aU is the 
middle child (perhaps also called the “upper child”), q-R is the right child. 

We want to define a map h : E^ u > —* R 2 with range equal to the Sierpinski 
gasket S. The gasket itself is a union of three parts, the “left” part, the “upper” 
part, and the “right” part. There are three dilations of R 2 corresponding 
to these three parts. They will be called now: /|_, /u, /r. The “addressing 
function” or “model map” h: E^ — ■» R 2 should be defined so that h[E^] = 
S. It should be continuous and satisfy h(La) = f\_{h(o )), h(\Ja) = fu(h(cr)), 
h( R<j) = /r(/i(ct)) for all strings a. 

We will describe this addressing function h using base 2 expansions in the 
(u, v ) coordinate system of Exercise 1.2.4. A string from the alphabet {L, U, R} 
maps to a pair (u,v), according to the rules in the following table: 
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letter 

L 

U 

R 


digit of u 
0 
0 
1 


digit of v 
0 
1 
0 


For example, 

/i(LRLUU ■■■)= ((0.01000 • • • ) 2 , (0.00011 ■ ■ • ) 2 ). 


By Exercise 1.2.4, the range of h is exactly the Sierpinski gasket S. 

The metric to be used on E^' 1 will be called £> 1/2 ■ If er ,r€ then 

Qi/2(v,t) = 

where k is the length of the longest common prefix of <7 and r. With this 
definition, we have diam[a] = (1/2) l“L (Recall the notation |a| for the length 
of a finite string a.) 


Exercise 4.2.1. The addressing function h defined above satisfies the Lips- 
chitz condition 

I Ha) - h{r)\ < g 1 / 2 (a,T). 

Exercise 4.2.2. Is the addressing function h inverse Lipschitz? 


General Definition 

Let (ri,r 2 ,-" ,r n ) be a contracting ratio list. The model for this ratio list 
will be the space of infinite strings on an alphabet E with n letters. If 
no better choice suggests itself, the set E = { 1 , 2 , • • • ,n} may be used as the 
alphabet. Usually, however, we will choose letters that suggest the intended 
example. But there will be understood a one-to-one correspondence between 
the letters of E and the ratios in the ratio list. When the alphabet is known, 
we will often even use them to label the ratio list, so we may write (r e ) e gE 
for (ri,r 2 , • • • ,r n ). 

For each letter e £ E, there is a corresponding function 6 e : E^ —> E^\ 
called a right shift, defined by 


9 e (a) = ea. 

That is, insert the letter e at the beginning of the string. We will define a 
metric on E ^ so that the right shifts (9 e )e^E form a realization of the given 
ratio list (r e ) eS £- 

To define a metric on E ^ , we will specify a “diameter” w a for each node 
a € £(*) of the tree. This is done recursively: 
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w A = 1, 

w ae = w a r e for a £ E ^ and e £ E. 

Alternatively, w a is the product, over all the letters e that make up a, of 
the ratios r e . In the Sierpinski gasket example above, for example, wlrlu = 
t’lT'rt’iTu- The metric g is defined from these diameters in the usual way. If 
there are at least two letters, then there are no exceptions to the formula 
diam[a] = w a . 

Now it is easy to verify that the iterated function system(0 e )eGB realizes 
the ratio list ( r e ) ee E ■ Suppose a, r G have longest common prefix a. If 
e £ E, then the longest common prefix of ea and er is ea. So 

g{0 e (a),0 e {T)) = w ea = r e w a = r e q(a,T). 

That is, 9 e is a similarity on (E^ u \ g) with ratio r e . 

The metric space E ^ is complete, so the right-shift realization (d e )eeE 
has a unique nonempty compact invariant set. In fact, that invariant set is 
the whole space E^\ The space E^\ together with the right shifts, will be 
called the string model of the ratio list {r e ) e ^E- 

Theorem 4.2.3 (String model theorem). Let S be a nonempty complete 
metric space and let (f e ) ee E be any iterated function system realizing the 
ratio list (r e ) e ^E in S. Assume that r e < 1 for all e. Then there is a unique 
continuous function h : E^ — > S such that 

M ecr ) = fe{h(a)) 

for all o £ E^ and e £ E. The range h[E^] is the invariant set of the 
iterated function system (/ e ) eg E. 

Proof. We will use uniform convergence. We will define recursively a sequence 
( gk ) of continuous functions gk : E^ — > S. Choose any point a £ S. Define 
<7o ( cr ) = n for all o\ If gk has been defined, then define gk+i by: 

g k +i(eo) = fe{g k (cr)) 

for e £ E and cr £ E^ u \ The function go is clearly continuous. We can verify 
by induction that gu+i is continuous, using the fact that each of the sets [e] 
is open. 

I claim that the sequence (gk) converges uniformly. Let r = max e r e , so 
that r < 1. Now E^' is compact, so g u (gi,go) is finite. We have 

g(gk+i(eo),g k (eo)) = g(f e (g k (o))J e (g k - i(cr))) 

< r e g(g k (o),g k _ 1 (o)) 

< rg u (g k ,g k _i). 
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Therefore g u (g k+1 ,g k ) < rg u (g k ,g k - 1 ). So g u (g k+1 ,g k ) < r k g u (gi,g 0 ) by in- 
duction. By the triangle inequality, if ro > k , then 

m— 1 oo 

Qu(g m ,gk) < ^2 e*(9j+i,9j) < ^2^ g u ( gi ,g 0 ). 

j—k j=k 

This is the tail of a convergent geometric series, so it approaches 0 as k — > 
oo. Therefore the sequence (g k ) is a Cauchy sequence in G(E( U \S). So it 
converges uniformly. Write h for its limit. 

Now we have by the definition of the sequence (gk), 


9k+l [£ M ] = U fe\g k [E (w) ] 


eGE 


The sequence of sets (g^E^]) converges to the invariant set by Corol- 
lary 4.1.4, and converges to ft [£?(“')] by Proposition 2.5.7. Therefore /i [£?(“)] 
is the invariant set. 

For the uniqueness, suppose that g: E^ — > S and h: E^ u > — > S with 
g(ea) = f e (g(o')) and h(ea) = f e (h(a)). Now with r = max e r e as before, we 
have by the same calculation as above, g u (g , h) < rg u (g , h). This is impossible 
unless g u (g, h) = 0. So g = h. □ 

We call h the addressing function. When x and er are related by x = 
h(a), we say that a is an address of the point x. 

Exercise 4.2.4. The addressing function h is Lipschitz when the string model 
has the metric described above. 


Exercise 4.2.5. Let K be the attractor for the iterated function system 
( fe)eeE , an d let h: E ^ — > S be the addressing function. Assume the family 
{ f e [K] : e £ E} has order n (in the sense defined on p. 91). Show that the 
map h is at most (n + l)-to-one. Conclude that ind K < n. 

This can compute the topological dimension for most of the self-similar 
examples we have seen. But not all. 

Exercise 4.2.6. Let K be the Menger sponge (p. 121). Compute its topolog- 
ical dimension. 


4.3 Graph Self-Similarity 

There is a generalization of self-similarity that provides a way to study a larger 
class of sets. A definitive formulation is due to Mauldin and Williams, but 
variants were used by others. Figs. 4.3.1 and 4.3.2 illustrate two examples. 
There will be a list of several nonempty compact sets to be constructed 
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Curve A Curve B Curve C 



A consists of a half-size copy of A and 
a half-size copy of B, both reflected 



B consists of a full-size rotated copy of C 
and a one-third-size rotated copy of A 




C consists of a quarter-size copy of B and a 
three-quarter-size copy of C, both reflected 

Fig. 4.3.1. The MW curves 


simultaneously. Each of them is decomposed into parts obtained from sets in 
the list using certain similarities. 

Li’s lace fractal (p. 84) is made up of isosceles right trianglar blocks of two 
kinds, called P and Q. Each of these sets is made up of parts that are similar 
copies of the same sets P, Q. Sometimes an illustration (as in Fig 4.3.3) can 
be used to specify the information on how the similarities act. The transfor- 
mations map the triangles P, Q to the smaller triangles that make them up. 
A nonsymmetric letter is used in the picture to show how the image should 
be oriented. 

For each such construction, there corresponds in a natural way a directed 
multigraph, with a positive number associated with each edge. There is one 




Fig. 4.3.2. Li’s lace 
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Fig. 4.3.3. Structure of Li’s lace 


node of the graph for each of the sets to be constructed. The edges from a node 
correspond to the subsets into which the corresponding set is decomposed, 
and the number associated with each edge corresponds to the ratio of the 
similarity. (Be careful to pay attention to which way the arrows go. It may 
seem more natural for some purposes to do it the other way around, but this 
is the direction chosen by Mauldin and Williams.) The structure involved, a 
directed multigraph (V, E, i, t) together with a function r : E — » (0, oo), will be 
called a Mauldin- Williams graph. For technical reasons we assume each 
vertex has at least one edge leaving it. 

Suppose ( V, E, i, t, r) is a Mauldin-Williams graph. An iterated function 
system realizing the graph is made up of metric spaces S v , one for each vertex 
v £ V, and similarities f e , one for each edge e £ E, such that f e : S v — > S u 
if e £ E uv , and f e has ratio r(e). An invariant set list for such an iterated 
function system is a list of nonempty compact sets K v C S v , one for each 
node v £ V, such that 

Ku= U fe[K v \ 

vev 

e£E uv 

for all u £ V. (An “invariant set list” may sometimes be called an invariant 
list or an attractor.) When a graph (V,E,i,t,r) and an iterated function 
system (/ e ) are related in this way, we may say that the iterated function sys- 
tem is directed by the graph, and call it a graph- directed iterated function 
system. 

Compare this abstract definition to the two examples. The Mauldin- 
Williams graphs are shown in Figs. 4.3.4(a) and 4.3.4(b). 

Each of the nonempty compact sets K v satisfying such equations will be 
said to have graph self -similarity . 

For this purpose, the ratio lists discussed above correspond to graphs with 
a single vertex, and one loop for each item in the ratio list. 
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Fig. 4.3.4. (a) Graph for the MW curves 



Fig. 4.3.4. (b) Graph for Li’s lace 


Existence of Invariant Set List 

Just as in the case of ratio lists, if proper conditions are satisfied, then the 
invariant list of sets exists and is unique. 

Theorem 4.3.5. Let (V,E,i,t,r) be a Mauldin-Williams graph. Suppose 
r(e) < 1 for all e € E. Let (f e ) e ^E realize the graph in nonempty complete 
metric spaces S v . Then there is a unique list (K v ) v g y of nonempty compact 
sets (I\ v C S v ) such that 


K u = U f e [K v ] 

vev 

e€.E uv 


for all u £ V. 

Proof. The proof is similar to the proof of Theorem 4.1.3. The metric spaces 
X(S V ) are complete. So the finite Cartesian product 

n 

vev 

is also complete with the metric given by the maximum of the coordinate 
metrics. Let us write (A v ) ve y for a typical element of this product space. The 
function defined by 


F({A v ) v€V ) 


U fMv 


vGV 

e£.E uv 


u£V 


is a contraction mapping, and its unique fixed point is the invariant list re- 
quired. □ 
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A Mauldin-Williams graph (V, E, i, t, r ) will be called strictly contract- 
ing if the conditions r(e) < 1 are satisfied. You can note that the graph in 
Fig. 4.3.4 for the MW cirves has an edge with value 1, so it is not strictly 
contracting. 

Exercise 4.3.6. Discuss the graph self-similarity of the boundary of Heigh- 
way’s dragon. 

Path Models 

Let us consider the Mauldin-Williams graphs, and their realizations as it- 
erated function systems of similarities. It will be useful to have a tree-type 
model for such a system. (Computing the Hausdorff dimension of the tree 
model will be much of the work in computing the Hausdorff dimension for 
sets with graph self-similarity.) The models will be analogous to the string 
models considered above. But (because of their construction) we will usually 
call them “path models’.’ 

Let (V, E,i,t,r) be a Mauldin-Williams graph. There is a “path forest” 
corresponding to it, as on p. 80. The definition of the edge-value function 
r can be extended to paths by defining: r(A u ) = 1 for empty paths, and 
r(ae) = r(a)r(e) for any path a and any edge e with t{a) = i(e). 

Given an iterated function system (f e ) e £E, acting on spaces S v , we may 
extend the notation to paths: /u u is the identity function on S u , and f ae is 
the composite function f a o f e , defined by f ae (x ) = f a ( y f e (x)J. In cases when 
confusion will be minimal, we will save writing by using the edge e itself to 
stand for the function / e ; that is, we may write e(x), where x £ <SY e ), as an 
abbreviation of f e (x). Similarly, if a is a finite path, we may write a(x) for 
fa(x). 

Let (V,E,i,t,r) be a strictly contracting Mauldin-Williams graph. The 
spaces E^ of infinite paths of the graph admit right shift maps as before: If 
e £ E uv , then we define 9 e : E ^ —> E^ by 9 e (a) = ea. One way to define 
metrics on the spaces E so that this family of maps realizes the Mauldin- 
Williams graph is as follows: If cr,r £ E^ u \ and a is their longest common 
prefix, then g(a, r) = r(a). [We will, however, use a different system of metrics 
in Sec. 6.6.] 

Exercise 4.3.7. With the metrics g defined, the right shift 9 e is a similarity 
with ratio r(e). 

Rescaling 

Suppose ( V,E,i,t,r ) is a Mauldin-Williams graph. Let maps f e realize it in 
spaces (S v , g v ). To rescale the realization, we replace each of the metrics g v 
by a constant multiple g' v = a v g v of itself. The sets S v are unchanged, and the 
maps f e are unchanged. Of course an invariant list for the original iterated 
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function system is exactly the same thing as an invariant list for the rescaled 
iterated function system. 

Under the new metrics, what happens to the contraction ratios of the maps 
/e? If e G E uv , then 

Qu{fe(x), fe(y)) = a u Q u (f e (x)J e {y)) 

= a u r(e)g v (x, y) 


Thus, with the new metrics, the maps f e realize a Mauldin-Williams graph 
(V, E , i, t , r'), where 

r'(e) = — r(e) for e £ E uv . 
a v 

The Mauldin-Williams graph (U, E, i , t, r') is called a rescaling of the graph 
(U, E , i, f, r). A Mauldin-Williams graph (V, E, *, t , r) will be called contract- 
ing iff it is a rescaling of a strictly contracting graph. 

Exercise 4.3.8. Show that the graph in Fig. 4.3.4 is contracting. 

Theorem 4.3.5 shows that a realization of a contracting graph in a complete 
space has a unique invariant list. That list can be constructed by the method 
analogous to Corollary 4.1.4. 

Finally, we have an alleged criterion for contractivity. 

Exercise 4.3.9. Prove or disprove: A Mauldin-Williams graph (V, E,i,t,r) 
is contracting if and only if r(a) < 1 for all nonempty cycles a £ E^*\ 

Similarity Dimension 

A Mauldin-Williams graph has a “sim-value” associated with it, in the same 
way as a ratio list. The case of a Mauldin-Williams graph with 2 nodes will 
be discussed now. The definitions for graphs with more than 2 nodes are given 
in Sect. 6.6. 

Let (U, E,i,t,r) be a Mauldin-Williams graph. Suppose V = {1,2}. Write 

A( S ) = £ r(ey 

ee-En 

B ( s ) = r ( e ) s 
e£Ei2 

C(s) = J2 Ke) s 

D (s) = r ( e ) s > 

e€E22 
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and let 


0 ^ _ ^(a) + D(s) + \J (A(s) — D(s )) 2 + 4j?(s)C(.s) 

The sim-value of the graph is the solution s of the equation ^(s) = 1. 
(Unfortunately, we will not see the reason for such a complicated definition 
until later.) 

As in the case of a ratio list, such a number s need not exist. But with 
reasonable additional conditions, the sim-value exists and is unique. Recall 
(p. 80) that a directed graph is strongly connected if there is a path from 
any node to any other node. 


Proposition 4.3.10. A strictly contracting, strongly connected Mauldin-Wil- 
liams graph with 2 nodes has a unique sim-value. 


Proof. Since the graph is strictly contracting, we have A(s) — > 0, B(s) — > 0, 
C(s) — > 0, and D(s) — > 0 as s — > 00 , so d>(s) — > 0. 

Since the graph is strongly connected, there is at least one edge from 1 to 2 
and at least one edge from 2 to 1. So B(0) > 1 and C(0) > 1. Then <P(0) > 1. 
Equality ^(0) = 1 holds only if A(0) = D( 0) = 0 and B( 0) = C(0) = 1; that 
is, the two edges postulated are the only edges. In that case the sim-value 
is 0. In all other cases, <P(0) > 1. Thus: if the graph is strictly contracting 
and strongly connected, then there is a nonnegative solution s to the equation 
<P(s) = 1 . 

I claim that the solution is unique since ^'( s ) < 0 for all s. The partial 
derivatives of <P are 


d$ _ 1 { 1 A — D 
dA ~ 2 + 2 y/(A- D ) 2 +ABC’ 

d$_ C 

dB ~ ^y(A — D ) 2 + ABC ’ 
d$ _ B 

dC ~ y/(A- DY + ABC 1 
d$ _ 1 | 1 D- A 
dD ~ 2 + 2 ^/(A — D ) 2 + ABC 


Recall that A(s) > 0, B(s) > 0, C(s) > 0, and D(s ) > 0. Then \A — D\ < 
\J{A — D ) 2 + ABC, so 


< 9 < 2 > 1 

dA~ 2 



54 > 

as 


> 0 , 


d<P 

dC 


> 0 , 


a<i> 1 
dD - 2 


1 

2 


0. 


The four derivatives satisfy A'(s) < 0, B'(s) < 0, C'(s ) < 0, and D’(s ) < 0. 
So we have <2>'(s) <0. □ 
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Suppose we have an invariant set list ( K v ) directed by a contracting 
Mauldin-Williams graph. We say that the sim- value of the graph is the simi- 
larity dimension of the sets K v . As before, this terminology may be inexact. 
But under the right conditions (discussed in Chap. 6) this sim-value equals 
the Hausdorff dimension of all the sets K v . 

Exercise 4.3.11. Compute the similarity dimension of the Li lace fractal. 
That is: Compute the sim-value of the graph in Fig. 4.3.4(b). 

We will do examples later involving the Mauldin-Williams graphs in 
Figs. 4.3.12 and 4.3.13. 



Exercise 4.3.14. Compute the sim-value of the graph of Fig. 4.3.12. 

Exercise 4.3.15. Compute the sim-value of the graph of Fig. 4.3.13. 

Here is the way in which the sim-value of a graph will be used. 

Proposition 4.3.16. Let (V, E, i , t , r) be a Mauldin-Williams graph with V = 
{1,2}. The number s > 0 is the sim-value of the graph if and only if there 
exist positive numbers x and y satisfying 

x = A{s)x + B(s)y 
y = C(s)x + D(s)y. 

Proof. Write A = A(s), etc. They satisfy A > 0, B > 0, C > 0, and D > 0. I 
must show that ( A + D + ^{A — D ) 2 + ABC )/2 = 1 if and only if there exist 
positive x and y with Ax + By = x, Cx + Dy = y. 

First, suppose (A + D + \/(A — D ) 2 + 4BC)/2 = 1. Then A + D < 2, 
so either A < 1 or D < 1. We will take the case A < 1; the other case is 
similar. Let x = B and y = 1 — A. Then x > 0 and y > 0. The first equation 
is Ax + By = AB + B(1 — A) = B = x. Algebra applied to the equation 
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(A + D+ y/(A- D) 2 +ABC)/2 = 1 yields BC - AD = 1 - A - D. For the 
second equation, Cx + Dy = CB + D{ 1 — A) = BC — AD + D = 1 — A = y. 

Conversely, suppose positive numbers x and y exist with Ax + By = x and 
Cx + Dy = y. Since y ^ 0, we may solve: 

B _ x _ 1 - D 

i c ' 

This means that (1 — A)(l — D) = BC, so with a little algebra, (A — D) 2 + 
4 BC = (2— A— D) 2 . Now B/(l—A) = x/y > 0, and B > 0, so A < 1. Similarly 
D < 1. So 2 — A — D > 0, and therefore y/(A — D ) 2 + 4 BC = 2 — A — D, or 
(A + L> + y/(A-D) 2 +4BC)/2 = 1. □ 

Exercise 4.3.17. How is the sim-value of a two-node Mauldin- Williams 
graph affected by rescaling? 

Exercise 4.3.18. State and prove a “path model theorem” analogous to The- 
orem 4.2.3, for graph self-similarity. 


4.4 ^Remarks 

The idea of self-similarity is explored by P. A. P. Moran [51] and by John 
Hutchinson [36]. It will be useful in Chap. 6 in computation of Hausdorff 
dimensions. Theorem 4.3.5 on the existence of the invariant set is taken 
from [36]. 

The “sim-value” for an iterated function system or for an M-W graph was 
called the “dimension” in the first edition. But that can be confusing. So we 
have used a different term in the second edition. It is not actually a dimension 
of the graph itself, or of the iterated function system, but of a set assiciated 
with them. 

Theorem 4.1.1 shows that the sim-value equation Y^ii=i r t = 1 admits a 
unique nonnegative real solution s. But if we allow s to be a complex number, 
there are other solutions. These so-called “complex dimensions” and their 
analogs have been explored primarily by Michel Lapidus. See [42]. 

Graph self-similarity is a generalization of self-similarity. There are several 
sources for similar ideas; the one used as the model here is [48]. The example 
shown in Figs. 4.3.1 was concocted to illustrate the idea. Li’s lace is from [43], 
Example (c), Sect. 3.3. 

The Menger sponge is a universal 1-dimensional space. Karl Menger de- 
fined this space (and corresponding spaces in higher dimensions) for this pur- 
pose. The exact statement is like Theorem 3.4.4: Let S' be a separable metric 
space. Then CovS < 1 if and only if S is homeomorphic to a subset of the 
Menger sponge. [8, p. 503ff]. 

Exercise 4.1.2: Approximate values of the similarity dimension: Heighway, 
2; Koch, 1.26; McWorter, 1.86; twindragon, 2; Eisenstein, 2; 120-degree, 1.26. 
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Exercise 4.1.8: log kj log |6|. 

Exercise 4.1.10: Topological dimension 1; similarity dimension approxi- 
mately 2.73. 

Exercise 4.3.14: 1. 

Exercise 4.3.15: The dimension is — log x/ log 3, where x is a solution of 
x 3 — x 1 — 2x + 1 = 0. So the dimension is approximately 0.737. 


He had finally made some progress on the Barnsleyformer; because the 
trick wasn't in the morphogenesis after all, but in the fractal geometry. 
— Michael F. Flynn, “Remembered Kisses',' ANALOG, December, 1988 
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Measure Theory 


This chapter contains the background from measure theory that is required to 
understand the Hausdorff dimension. It is true that the Hausdorff dimension 
can be defined in half a page without reference to measure theory, but when 
it is done that way there is no indication of the motivation for the definition. 

Measure theory will also be indispensable in many of the proofs related 
to fractal dimension. It will simplify many of the proofs of lower bounds for 
Hausdorff dimension and of upper bounds for packing dimension. Instead of 
repeating a combinatorial calculation in each instance, we do the combina- 
torics once and for all in this chapter, and then repeatedly reap the benefits 
in Chap. 6 . 

Since measure theory (like metric topology) is a standard part of graduate 
mathematics curriculum today, most of the introductory remarks to Chap. 2 
are also applicable here. 


5.1 Lebesgue Measure 

Certain calculations will be done with the symbols oo and — oo. They are not 
real numbers, but they can be useful in connection with calculations involving 
real numbers. Most of the conventions are sensible when you think about them. 
Here are some examples: 

( 1 ) If a € M, then — oo < a < o o. 

( 2 ) If a € R, then a + oo = oo and a — oo = — oo . Also 00 + 00 = 00 and 

—00 — 00 = —00. The combination 00 — 00 is not defined. 

( 3 ) If a £ R. is positive, then a x 00 = 00 and a x (—00) = —00. If a € M 

is negative, then ax 00 = —00 and ax (—00) = 00. The combination 
0 x 00 is not defined. [However, we do understand that an infinite series 

a n, where every term a n = 0 , has sum 0 .] 
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The length of one of the intervals 

(a,b) ( a,b ] [ a,b ) [a,b] 

is b—a, where a, b £ R and a < b. The length of the degenerate interval [ a, a ] = 
{a} is 0; the length of the empty set 0 is 0. The length of an unbounded 
interval 

(a, oo ) [a, oo) (— oo ,b) (— oo,b] (— 00 , 00 ) 

is 00 . This follows the conventions on calculation with 00 . 

We will be interested in a substantial generalization of the notion of the 
“length” of a subset of R. The lemma that makes it possible asserts that the 
length of a countable union of intervals cannot exceed the sum of the lengths 
of the parts. 

Lemma 5.1.1. Suppose the closed interval [c, d) is covered by a countable fam- 
ily of open intervals: 

[ c,d ] C (J(ai,6i). 

Then 

OO 

d— c< ^^{bi - a,i). 

2=1 

Proof. First, since [c, d] is a compact set, it is in fact covered by a finite number 
of the intervals: 

n 

[■ c,d } C (J(aj,6i) 

2=1 

for some n. I will show that when this happens, the conclusion 

n 

d- c < ^2(bi - Oj) 

follows. The proof is by induction on n. 

If n = 1, then [c, d] C (ai, £> 1 ), so ai < c and d <b\. Thus d — c < bi — a\, 
as required. 

Now suppose n > 2, and the result is true for covers by at most n — 1 open 
intervals. Suppose 

n 

[ c,d\ C (J (ai,bi). 

j= 1 

If some interval (a*, bi) is disjoint from [c, d], it may be omitted from the cover; 
then we have a cover by at most n — 1 sets, so we would be finished by the 
induction hypothesis. So assume (aj,6j) D [c, d] ^ 0 for all i. Among all of 
the left endpoints Oj, there is one that is no larger than any of the others. By 
renumbering the intervals, let us assume that it is ai. Since c is covered, we 
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must have a\ < c. Now if b i > d , we have d — c < bi — ai < Y^i=i(bi — ai), 
so we are finished. So suppose b\ < d. Since (ai,&i) intersects [c, d], we have 
bi > c. So b\ € [c, d]. At least one of the open intervals (oj, bi) covers the point 
&i. By renumbering, we may assume it is ( 02 , 62 )- Finally, we have a cover of 
[c, d] by n — 1 sets: 

n 

[c,d] C (ai,& 2 ) U (J(aj, 6 i). 

i—3 

So by the induction hypothesis, 

n 

d- c < (6 2 - ai) + 

i = 3 

n 

< (b 2 - a 2 ) + (61 - ai) + - ai) 

i = 3 


as required. This completes the proof by induction. □ 

A useful generalization of the notion of the length of a subset of M is the 
Lebesgue measure of the set. This will be defined in stages. We will use 
half-open intervals of the form [a, b) in the definition. Intervals of other forms 
could be used instead, but these have been chosen because of this convenient 
property: 

Lemma 5.1.2. Let a < b be real numbers, and e > 0. Then [a, b) can be 
written as a finite disjoint union 

n 

[a, b) = U[ai, 6 i), 

i - 1 

with b — a = E i=1 (bi ~ a i) an d bi — ai < e for all i. 

Proof. Choose n £ N so large that {b — a)/n < e. Let bi = a + i{b — a)/ n for 
0 < i < n, and a* = bi- 1. □ 

Now let A be any subset of R. The Lebesgue outer measure of A is 
obtained by covering A with countably many half-open intervals of total length 
as small as possible. In symbols,* 

OO 

£(A) = inf J2(bj ~ aj) 

3 = 1 

where the infimum is over all countable families { [a j , b 3 ) : j £ N} of half-open 
intervals with A C UjGN[ a i’ bj)- 

* In case you can’t tell, the symbol C is supposed to be a fancy letter L, for 
“Lebesgue’.’ 
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Lemma 5.1.3. Let ACK and let e > 0. Then 

OO 

C(A) = inf ~ aj) 

3=1 

where the infimum is over all countable families { [aj, bj) : jgf}} of half- open 
intervals with A C bj) an d bj — aj < e for all j . 

Proof. This follows from Lemma 5.1.2. □ 

We must do some combinatorics on the line to see that the definition is 
not trivial.* 

Theorem 5.1.4. If A is an interval , then C(A) is the length of A. 

Proof. Suppose A = [a, b], where a < b are real numbers. First, if e > 0, then 
the singleton {[a, b + e)} covers the set A, so C(A) < b — a + e. This is true 
for any e > 0, so C{A) < b — a. 

Now suppose A C \J jGN [aj,bj). Let e > 0, and write a' = a 3 — e/2L 
Then A C |J jeN (a' , bj). By Lemma 5.1.1, Y^jLifij ~ a 'j ) > b — a. So we have 
Y2JLi(bj — a 3 ) > — a'j) — e > b — a — e. This is true for any e > 0, so 

Y^jL\(bj — aj) >b — a. Therefore C(A) > b — a. So we have £([a, b ]) = b — a. 

Next consider A = (a, b). Then C(A) < £([a, 6]) = b — a and on the other 
hand C(A) > C{[a + e, b — e]) = b — a — 2e for any e > 0. Similar arguments 
cover cases [a, b) and (a, b\. If A = [a, oo), then AD [a, a + t\ for any t > 0, 
and therefore C(A) > t; this means that C(A) = oo. Similar arguments cover 
the other cases of infinite length intervals. □ 

Here are some of the basic properties of Lebesgue outer measure. 

Theorem 5.1.5. (1) £(0) = 0; 

(2 ) if AC B, then Z(A) <_Z(B); 

(3) £(U„ gN A„)<£~i£(A0. 

Proof. For (1), note that 0 C (J igN [0, e/2 4 ), so £(0) < e. For (2), note that 
any cover of B is also a cover of A. 

Now consider (3). If C(A n ) = oo for some n, then the inequality is clear. 
So suppose C(A n ) < oo for all n. Let e > 0. For each n, choose a countable 
cover D n of A n by half-open intervals with 

^ £(£>) < £(A n ) + 2~ n e. 

DeD n 

Now D = UneN i s a countable cover of the union Therefore 

* I can easily write down the same definition for subsets of the rational numbers. 
But then every set turns out to have outer measure 0, so it is not a very useful 
definition. 
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c ( U ^ E £ ( D ) EE £ ( fl ) 

VnGN / DeD n=lDgB„ 

oo oo oo 

< T(A ra ) + 2 n e = T(A n ) + e. 

n=l n=l n— 1 

Since e was any positive number, we have 

£( |J A„j < E £ (^)' D 

In general, the inequality in part (3) is not equality, even for two disjoint 
sets. But we do have equality in some cases. The simplest case is the following: 

Theorem 5.1.6. Let with dist(A, B ) > 0. Then Z(AUB) = Z(A) + 

C{B). 

Proof. First, the inequality Z{A U B) < Z{A) + Z(B) follows from part (3) 
of Theorem 5.1.5. Let e = dist(A, B)/ 2, and let A U B C UjgnI 0 !’ bj), where 
bj — a.j < e for all j. Then each interval [a-j , bj ) intersects at most one of the 
sets A and B. So the collection “D = {[a j,bj) : j £ N } can be written as the 
disjoint union of two collections, D = T>\ U T> 2 , where T>i covers A and T >2 
covers B. Now Z(A) < X] DgI)l Z(D) and Z(B) < J2dg'D 2 £ (^)’ so 

OO 

z{a)+z{b)< y ad)+ Y £ ( D )= 

DeDi D 6D 2 DeD j=l 

Therefore, by Lemma 5.1.3, we have Z(A) + Z(B) < Z{A U B). □ 

Corollary 5.1.7. If A,B CB are disjoint and compact, then we have Z{A) + 
Z(B) = Z(AuB). 

Proof. Apply Theorems 2.3.19 and 5.1.6. □ 

Theorem 5.1.8. If ACM., then 

Z(A) = inf { Z(U) : U O A, U open } . 

Proof. Certainly Z(A) < inf { Z(U) : U D A, U open }. So I must prove the 
opposite inequality. If Z{A) = oo, it is trivially true. So suppose Z{A) < oo. 
Let e > 0. Then there exists a cover (J JgN [aj,bj) of A with iOj - a,j) < 
Z(A) + e/2. Now the set U = U yg N( a i ~ e/2 J ' +1 ,6j) is open, U O A, and 
Z(U) < — aj) + e/2 < Z(A) + e. Therefore Z(A) + e > Z(U). This 

shows that Z{A) > inf { Z{U) : 1/34}. □ 
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The outer measure C{A) of a set A C ]R is determined by approximat- 
ing a set from the outside by open sets. There is a corresponding “inner 
measure’,’ obtained by approximating a set from the inside. This time, how- 
ever, we will use compact sets. 

Let A C R. The Lebesgue inner measure of the set A is 

C(A) = sup { C(K) : K C A, K compact } . 

Again, we need an argument to see that the definition is interesting. 

Theorem 5.1.9. If A is an interval, then C{A) is the length of A. 

Proof. We consider the case of an open interval A = (a, b) . Other kinds of 
intervals follow from this case as before. 

If I\ C A is compact, then K is covered by the single interval A, so that 
C(K ) <6 — a. Therefore C(A) < b — a. On the other hand, if e > 0, then the 
set [a + e, 6 — e] is compact, so CfA) > £([a + e, 6 — e]) = 6 — a — 2e. This is 
true for any e > 0, so CfA) > b — a. □ 

Exercise 5.1.10. If A C M is any set, then CfA) < C{A). 

It is not possible to prove that CfA) — C{A) in general. A set A is called 
Lebesgue measurable , roughly speaking, when this equation is true. Pre- 
cisely: If C(A) < oo, then A is Lebesgue measurable iff CfA) = C{A). If 
C(A) = oo, then A is Lebesgue measurable iff A n [— n, n] is Lebesgue mea- 
surable for all n £ N. If A is Lebesgue measurable, we will write C{A) for 
the common value of C{A) and CfA), and call it simply the Lebesgue mea- 
sure of A. We will often say simply measurable when we mean Lebesgue 
measurable. 

Theorem 5.1.11. Let A\,A 2 ,- ■ ■ be disjoint Lebesgue measurable sets. Then 
U n A n is measurable, and C{[} n A n ) = ^ n £(A n ). 

Proof. It is enough to prove the theorem in the case that £((J A n ) < oo, since 
the general case will then follow by applying this case to sets A n D [— m, m] . 
We know by Theorem 5.1.5 that £((JA n ) < YfC{A n ). Let e > 0. For each 
n, choose a compact set K n C A n with C{K n ) > C(A n ) — e/2 n . Since A n 
is measurable, C(K n ) > C(A n ) — e/2”. Now the sets K n are disjoint, so by 
Corollary 5.1.7, the compact set L m = Ki U K 2 U • • • U K m satisfies C(L m ) = 
C{Kf) + • • • + C(K m ). Therefore C(\]A n ) > l C(K n ). Now this is true 
for all to, so £(|J A n ) > C(A n ) - e. This is true for any 

positive e, so we have C(fJ n A n ) > YfC(A n ). 

So £(U A n ) = £(|J A n ), and therefore (J A n is measurable and £((J A n ) = 

□ 

Theorem 5.1.12. Compact subsets, closed subsets, and open subsets ofM. are 
Lebesgue measurable. 
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Proof. Let K C R be compact. It is bounded, so K C [—n, n] for some n, and 
therefore £(K) < oo. The compact set K is a subset of K, so £{K) > C(K). 

Let F C R be a closed set. Then for each n £ N, the intersection FC I [— n, n] 
is compact, and therefore measurable. Thus F is measurable. 

Let U be an open set. It is enough to do the case £{U) < oo. For each 
x £ U , there is an open interval I with x £ I C U . By the Lindelof property, 
U is the union of countably many of these intervals, say U = Now 

each set I n \ U/=i h is a finite union of intervals (open, closed, half-open) so 
that U is a disjoint union of countably many intervals. So U is measurable. □ 

Theorem 5.1.13. Let ICR. Then A is measurable if and only if, for every 
e > 0, there exist an open set U and a closed set F with U A A D F and 
£(U\F) < e. 

Proof. Suppose first that A is measurable. We consider first of all the case 
£{A) < oo. Then there exists an open set U A A such that £(U) < £(A)+e/ 2. 
There exists a compact (therefore closed) set F C A with £(F) > £{A) — e/2. 
Now U \ F is open, hence measurable, and F is compact, hence measurable, 
so C(U) = £{U \ F) + £(F). Since the terms are all finite, we may subtract, 
and we get 

£(U \F) = £(U) - £(F) < £{A) + e/2 - £(A) + e/2 = e. 

Now we take the case C{A) = oo. All of the sets Ar\[—n, n] are measurable, 
so there exist open sets U n A n [— n, n] and compact sets F n C A fl [— n, n ] 
with C(U n \ F n ) < e/2 n+2 . Define U' n = U n fl ((— oo, — n + 1 + e/2 n+2 ) U 
(n — 1 — e/2 n+2 ,oo)) and F " n = F n fl ([— n, — n + 1] U [n — l,n]), so that 
U' n is open, F' n is compact, U' n A A fl ([— n, — n + 1] U [n — l,n]) D F " n and 
C(U' n \ F' n ) < 3e/2"+ 2 < e/2 n . Now U = {jU' n is open, and F = \jF' n is 
closed (Exercise 2.2.27). We have U A A D F, and U \ F C U ra eN(^ r n \ ^n)’ 
so that C(U \F)<J2 C{U' n \ F' n ) < e. 

Conversely, suppose sets U and F exist. By Theorem 5.1.12 they are 
measurable. First assume M(A) < oo. Then £(F) < oo, and C(U) < 
C{U \ F) + £(F) < e + C{F) < oo. Now M(A) < M(C)_ = C(U) <L(F) + £ = 
£(F) + £ < C(A) + £. This is true for any e > 0, so M(A) = £(A), so A is 
measurable. 

For the case £(A) = oo, we have U fl (— n — £,n + e) D An [— n,n] A 
F fl [— n,n], and the previous case may be applied to these sets, using 3e in 
place of e. □ 

Here are the basic algebraic properties of Lebesgue measurable sets. 

Theorem 5.1.14. (1) Both 0 and R are Lebesgue measurable. 

(2) If A CM. is Lebesgue measurable, then so is its complement R \ A. 

(3) If A and B are measurable, then so are An B, AU B, and A\B . 

(4) If A n is measurable for n £ N, then so are U n eN^ n am ^ 
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Proof. For (1), note that £(0) = 0 and R fl [— n, n] is measurable for all n. 

For (2), note that if F C A C U, then R \ Z7 C R \ A C R \ F and 
(R \ F) \ (R \ U) = U \ F. 

For the intersection in (3), note that if Fi C A C U\ and F2 C B C U2, 
then F 1 nF 2 C AnB C U 1 nU 2 and (C/inl7 2 )\(F 1 nF 2 ) C (E7i\F 1 )U(l7 2 \F 2 ). 
This is enough to show that A D B is measurable. Now AU B = R \ ((R \ 
A) n (R \ F)) , so A U B is measurable. And A \ B = A n (R \ F), so A \ B is 
measurable. 

Finally, for (4), note that by (3) we may find disjoint measurable sets 
B n with the same union as A n , so that Theorem 5.1.11 is applicable. The 
intersection follows by taking complements. □ 

Note that (4) involves only countable unions and intersections. 

Proposition 5.1.15. The Lebesgue measure of the triadic Cantor dust is 0. 

Proof. The Cantor dust C is constructed on p. 2. The set C n AC consists of 
2" disjoint intervals of length 3 -n . Therefore £(C) < 2 n ■ 3~ n . This has limit 
0, so £(C) =0. □ 


Caratheodory Measurability 

Caratheodory provided an alternate definition of measurability. Its disadvan- 
tage is that the motivation is not as clear. Its advantage is (as we will see in 
Sect. 5.2) that it can be used in other situations. 

A set A C R is Caratheodory measurable iff 

£(E) = ~C(E C A) + ~C(E \ A) 


for all sets E C R. 

Proposition 5.1.16. A set A C R is Caratheodory measurable if and only if 
it is Lebesgue measurable. 

Proof. Suppose A is Lebesgue measurable. Let E be a test set. The inequality 
£(E) < £(F fl A) + £(F \ A) is always true. Let e > 0. There exist an open 
set U and a closed set F with F C A C U and C{U \F) < e. Let PDF be 
an open set. Then 

£(F \ A) + £(F n A) < C(V \ F) + £(P n U) 

< c(v \u) + £({/ \ f) + £(v n u) 

< £(P) + £. 

Now take the infimum over all such V, to get £(Ffl A) +£(F\ A) < £(F) + e. 
Therefore £(F ft A) + £(F \ A) < £(F). This proves that A is Caratheodory 
measurable. 
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Conversely, suppose A is Caratheodory measurable. Consider the case in 
which £(A) < oo. Let e > 0. Let U D A satisfy C{U) < C{A) + e. Now we 
have 

Z{U) = C(U D A) + C(U \ A ), 

so that C(U \ A) < e. So there is an open set V A U \ A with C{V) < e. 
Then U \ V is Lebesgue measurable, and £(U \ V) > C(U) — e, so there is 
a closed set F C U \ V C A with £(F) > C(U) — e. Thus F C A C U and 
C(U \ F) < e. Therefore A is Lebesgue measurable. □ 

Theorem 5.1.17. Let A C R be Lebesgue measurable, and let a similarity 
f: R. — > K with ratio r be given. Then f[A] is Lebesgue measurable and 
C{f[A]) = rC{A). 

Proof. [Strictly speaking, “similarity” disallows r = 0, but even if r = 0 is 
allowed, this formula still works: If r = 0, then the range of / is a single point, 
so of course f[A] is measurable and C(f[A]) = 0.] Now suppose r > 0. 

Consider an interval / = [a, b). The image is an interval, either [/(a), f(b)) 
or (/(&), /(a)], with length |/(6) — /(a)| = r\b — a|. Therefore £(/[/]) = 
r\b - a\. Now if A C U j&iWj’bj), then f[A] C [j f[[aj,bj)], so c(f[A]) < 
^2 Z(f\\aj,bj)]) = rJ2(bj — dj). Therefore we have £(f[A]) < rC{A). If we 
apply the same thing to the inverse map / -1 , which is a similarity with ratio 
1/r, we get £(f[A]) > rC(A). Therefore C(f[A]) = rC{A) 

Now / is a homeomorphism, so the image of an open set is open and the 
image of a closed set is closed. If A C R is measurable, then, for every e > 0, 
there exist a closed set F and an open set U with F C A C U and C(U\F) < e. 
So we have f[F] C f[A] C f[U] and C(f[U] \ f[F ]) = C(f[U \ F]) < re. So 
f[A] is also measurable. □ 

Next is a preview of how measure theory is related to fractal dimension. In 
general, we do not yet know that the similarity dimension of a set is unique. 
However, we can now establish that in one situation we can determine the 
similarity dimension. 

Exercise 5.1.18. Let (ri,f 2 , • • • ,r n ) be a contracting ratio list; let s be its 
sim- value; let (/i, fi, ■ ■ ■ , f n ) be a corresponding iterated function system in 
K; and let A C M be a nonempty measurable set. Suppose £(fj[A]nfk[A]) = 0 
for j k, and 

n 

A=\Jf j [A). 
o = i 

If 0 < C(A) < oo, then s = 1. 

Number Systems 

Recall the situation from Sect. 1.6. Let b be a real number, and let D be a 
finite set of real numbers, including 0. We are interested in representing real 
numbers in the number system they define. 
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Write W for the set of “whole numbers” ; that is, numbers of the form 

M 

3=0 

Write F for the set of “fractions” ; that is numbers of the form 

H a i^- 

j =— CO 

We know (Proposition 3.2.21) that there is no number system that has a 
unique representation for every real number. So we will try to represent all real 
numbers, and arrange to have as few numbers as possible with more than one 
representation. One way to specify that the set with multiple representations 
is small is to require that it have Lebesgue measure 0. 

If we analyze the size of the intersections (F + wi)C\(F +w 2 ), Wi,w 2 G W, 
w\ ^ w 2 , then we will know about all numbers with multiple representations: 

Exercise 5.1.19. The set of all numbers with multiple representations is a 
countable union of sets, each of which is similar to one of the sets (F + wi) fl 
(F + w 2 ), w\,w 2 € W, w\ ^ w 2 . 

The set of all numbers that can be represented is F + W, a countable 
union of sets isometric to F. So if all real numbers can be represented, then 
C(F) > 0. We know that F is a compact set, so also C(F) < oo. If the set of 
all real numbers with multiple representations has Lebesgue measure 0, then 
the sets (F + w{) fl (F + w 2 ) have Lebesgue measure 0. 

Suppose D has k elements. If F has positive Lebesgue measure, but the 
intersections (F + wi) fl (F + w 2 ) have Lebesgue measure zero, then by Ex- 
ercise 5.1.18, F has similarity dimension 1. But the similarity dimension is 
actually log fc / log 1 6 1 . Therefore \b\ = k. 


5.2 Method I 

We will need to discuss measures other than Lebesgue measure. The basics 
are contained in this section. 

Measures and Outer Measures 

A collection T of subsets of a set X is called an algebra on X iff: 

( 1 ) 0,XeJ; 

(2) if A e T, then X \ A € J; 

(3) if A, Be J, then A U B € T. 
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Note that A D B = X \ ((A' \ A) U ( A \ B)) and A\I3 = An(A\ B ), so an 
algebra is also closed under these two operations. 

A collection IF of subsets of a set A is called a a-algebra on X iff: 

(1) 0,leJ; 

(2) if A £ 3) then A \ A £ 3; 

(3) if A 1 , A 2 , ■ ■ ■ £ 3, then U ieN A t £ 37 

Of course (by Theorem 5.1.14), the collection of all Lebesgue measurable 
subsets of K is a cr-algebra on 1R. Combining the clauses of the definition will 
produce a few more rules: For example, if A±, A 2 , ■■•€?, then P| JgN £ 3; 
if A, B £ 3 then A D B, A U B, A \ B £ 3. 

Theorem 5.2.1. Let X be a set, and let T> be any set of subsets of X. Then 
there is a set 3 of subsets of X such that 

(1) 3 is a a-algebra on X; 

(2) 3 D B; 

(3) if 3 is any a-algebra on X with 3 A T>, then 3 2? 3. 

Proof. First I claim that the intersection of any collection of cr-algebras on X 
is a cr-algebra. Let T be a collection of cr-algebras, and let B = f) 4er A be the 
intersection. Then 0 £ A for all A £ T, so 0 £ IB. Similarly X £ CB. If A £ B, 
then A £ A for all A £ r, so X\A £ A for all A £ T, and therefore X\ A £ “B. 
If Ai, A 2 , ■ ■ ■ £ B, then each A n £ A for all A £ P, so (J neN A n £ A for all 
A £ r and therefore (JneN^« € 

So suppose a set D of subsets of X is given. Let r be the collection of 
all cr-algebras 3 on X with 3 A T>. (There is at least one such cr-algebra, 
namely the family of all subsets of A.) Then the intersection T = Dggr 3 is 
a cr-algebra on A. But clearly if 3 is any cr-algebra on A with 3 2®, then 
3 £ r, and therefore 3 2? 32 □ 

We say that 3 is the least a-algebra containing D, or the cr-algebra 
generated by V. Let S' be a metric space. A subset of S is called a Borel 
set iff it belongs to the cr-algebra on S generated by the open sets. 

Let A be a set, and let IB be a cr-algebra of subsets of A. A measure on 
T is a set function* M: 3 r — > [0,oo] such that: 


(1) M(0) = 0; 

(2) If A n £ 3 r is a disjoint sequence of sets, then 



We call (2) countable additivity. 

Let A be a set. An outer measure on A is a function JVC defined on all 
subsets of A, with values in the nonnegative extended real numbers [0, oo], 
satisfying: 

* A set function is a function whose domain is a family of sets. 


148 5 Measure Theory 


(1) M(0) = 0; 

(2) if A C B, then M(A) < _M(R); 

(3) JVt(UraeN^") — En=l 

We call (3) countable subadditivity. 


Defining Outer Measures 

The Lebesgue outer measure was constructed in Sect. 5.1. The way in which 
the definition was formulated was not accidental. We will explore a general 
method for constructing outer measures, known as “method I” We begin with 
candidate values for the measures of some sets (like the lengths of the half- 
open intervals), and then attempt to produce an outer measure that is as large 
as possible, but no larger than the candidate values. 

Let X be a set, and let A be a family of subsets of X that covers X. 
Let c: A — > [0,oo] be any function. The theorem on construction of outer 
measures is as follows: 

Theorem 5.2.2 (Method I Theorem). There is a unique outer measure 
M on X such that 

(1) M(T) < c(A) for all A G A; 

(2) if X is anu outer measure on X with NM) < cM) for all A G A, then 
X(B) < M (B) for all BCX. 

Proof. The uniqueness is easy: if two outer measures satisfy (1) and (2), then 
each is < the other, so they are equal. 

For any subset B of X, define 

M (B) = inf E m 

Aev 

where the infimum is over all countable covers T> of B by sets of A. (Recall 
that inf 0 = oo, so if there is no countable cover at all of B by sets of A, then 
M(B) = oo.) _ ___ 

f claim first that M is an outer measure. First, M(0) = 0, since the empty 
set is covered by the empty cover, and the empty sum has value 0. If B C C, 
then any cover of C is also a cover of B , so M (B) < M (C). Let Bi,B 2 , • • • be 
given. I must prove 

M ( (J B n J < jrM(B n ). 

VraeN / n= 1 

If M (B n ) = oo for some n, then the inequality is clear. So suppose M(f3 n ) < oo 
for all n. Let e > 0. For each n, choose a countable cover T> n of B n by sets of 
A with 

c(A) < M (B n ) + 2~ n e. 

Aev„ 


5.2 Method I 


149 


Now D = U raS N a countable cover of the union (J ngN B n- Therefore 

m MJ B « <E c ( A ) 

VraGN / AeD 
oo 

< E E <=w 

n=l AeDn 

OO oo 

< £ M(R„) + £ 2" n £ 

n = 1 


n— 1 
oo 

= £^ M(S ra ) + £. 


n — 1 


Since e was any positive number, we have 

mMJbJ < jrmBn). 


VnG N 


n— 1 


This completes the proof that DVC is an outer measure. 

Now we may check the two assertions of the theorem. For (1), note that 
for A a A, the singleton {M} is a cover of A, so 

M(A)< £ c(B) = c(A). 

be{A} 

For (2), suppose that N is any outer measure on X with N(^4) < c(^4) for all 
A € A. Then for any countable cover D of a set B by elements of A we have 

£ c(A) > £ N(A) > Jf ( (J A \ > X(B). 

AeD AeD Vagd / 

Therefore 'M(B) > ’N(B). □ 

When we say that an outer measure is to be constructed by method I. 
we are referring to this theorem. In practical terms, this means that the outer 
measure is defined by the formula (I). 


Reduced Cover Classes 

When a measure is defined by method I, it may be helpful to know that the 
covers T> in (I) can be chosen from a smaller (“reduced”) class of sets. 

Proposition 5.2.3. Let X be a set, and let c be a set function. For a collec- 
tion of sets A, let be the method I outer measure defined using the class 
A of sets and the restriction of c to A. 
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(a) IfUCA, then M.a < . 

(b) Suppose that , for every A £ A and every e > 0, there is B £ 23 with B A A 
and c{B) < c(H) + e. Then Ms < M^.. 

(c) Let C > 0 be a constant, and suppose that, for every A £ A, there is B £ 23 
with BAA and c (B) < Cc(A). Then Ms < CJvIji. 

Proof, (a) The outer measure Ms is the largest outer measure such that 
Ms(£) < c (E) for all E £ 23. But M^ also has this property, so M^ < Ms- 

(b) Let e > 0 be given. Let T> = {A\, A 2 , ■ ■ ■ } C A be a countable cover of 
a set E. For each Aj, choose Bj £ 23 with Bj A Aj and c(Bj) < c(Aj) + e/2 J . 
Then D' = {Bi, B 2 , • ■ ■ } is also a cover of E , and 

* + £c(a,)>E c(Bj) > Ms (E). 

3 3 

Take the infimum over all countable D C A that cover E to get 

£ + Myi(-E') > Mj (£). 

This is true for all e > 0, so Myi(-E) > Ms(£?). 

(c) Let T> = {Ai, A2, • • • } C A be a countable cover of a set E. For each Aj, 
choose Bj £ 23 with Bj D Aj and c(Bj) < C c (Aj). Then D' = {B\, B2, ■ • ■ } 
is also a cover of E, and 

C7^c(^)>^c(^)>Ms(-B). 

3 3 

Take the infimum over all countable D C A that cover E to get 

CMy,(£)>Ss(£)- □ 

When condition (b) holds, we will say that 23 is a reduced cover class 
for M. When condition (c) holds, we will say that 23 is a reduced cover class 
with factor C for M. 

Here is an example. Lebesgue measure C on R is defined (p. 140) using 
the class of all intervals [a, h). The semi-dyadic intervals are sets of the 
form [(fc — l)/2 n ,(fc + l)/2"), where n £ Z, k £ Z. The class of all semi- 
dyadic intervals is a reduced cover class with factor 4 for Lebesgue measure. 
Indeed, if a < b, let n be the integer with 2~ n ~ 1 < b — a < 2~ n and k the 
integer with k — 1 < a/2 n < k, and compute [k/2 n , (k + l)/2 n ) A [b — a) and 
( k + l)/2 n -(k- l)/2 n < 4(6 - a). 

The dyadic net is the class of intervals of the form [k/2 n , (k + 1) /2 n ). It is 
not a reduced cover class by itself. Let 'A n consist of the finite disjoint unions 
of dyadic intervals [k/2 n , (k+ l)/2 ra ) with denominator 2 n . The dyadic ring 
is Ik = (J„ 3l„. 

Proposition 5.2.4. Using the set function c: 1R — > [0,oo) defined by c (E) = 
C{E), the dyadic ring 2 1 is a reduced cover class for Lebesgue measure. 
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Proof. Let a < b and let e > 0. Let n £ N be so large that 2~ n < e/2. Let 
j £ Z be such that j < 2 n a < j + 1, and m > j such that to < 2 n b < m + 1. 
Then 


i,b)CE:= (J 

k—j 


k k + 1 


and 


£{E) ~{b-a)<£ 



m + 1 


< 


< e. 


□ 


Measurable Sets 

Let JVC be an outer measure on a set X. A set A C X is JVC -measurable (in 
the sense of Caratheodory) iff JVC (E) = JVC (E fl A) + JVC(£i \ A) for all sets 
E Cl 

Theorem 5.2.5. The collection T of JA-measurable sets is a a-algebra, and 
JVC is countably additive on T. 

Proof. First, 0 £ T since for any E, we have JVC(if D 0) + JVC (E \ 0) = 
JVC(0) + JVt(£J) = JVC(iJ). It is also easy to see that a set A belongs to T if and 
only if its complement X \ A does. 

Suppose Aj £ T for j = 1, 2, • • • . Let E be any test set. Then 

M (E) = JVC (E D Ai) + M (E \ A x ) 

= M (E n Ai) + M((£ \ Ai) n A 2 ) + JVC (E \ (Ai U A 2 )) 


j - 1 


= E M +JVC £\LU 


3 — 1 


Hence 


i=i 


3 - 1 


3 = 1 


JVC(-EJ) >J2 M [ U\ U A A nA i + M U A i ’ 


3 = 1 


so (let k — > oo) 




3 - 1 


Jvc (A) >£JYt(J^U^n^)+JvM^\lM 


3 ■ 


i = 1 


But 


J-i 


^ n lK = U maIM 

jeN jeN \ V 2=1 / / 
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SO 


< M 

IMi 

V 

je n / 

oo / 

/ 3-1 

<£m 

£ \U 

3 = 1 V 

V i= 1 

<M(E). 



16 N 


' u i 

jGN 


Thus (J Aj £ 3\ This completes the proof that T is a cr-algebra. 

Now if the sets Aj £ T are disjoint, we can let E = [JAj in the previous 
computation, and we get 


View / 3=1 

so M is countably additive on T. □ 

We will write simply M for the restriction of M to the cr-algebra T of mea- 
surable sets. It is a measure on 3". Thus we see that we have a generalization 
of Lebesgue measure as constructed in Sect. 5.1. 

Corollary 5.2.6. Every Borel set in R is Lebesgue measurable. 

Proof. Open sets are measurable by Theorem 5.1.12. The collection of mea- 
surable sets is a cr-algebra by Theorem 5.1.14. □ 


5.3 Two-Dimensional Lebesgue Measure 

We next define two-dimensional Lebesgue measure. This is a measure defined 
for subsets of the plane R 2 . 

A rectangle in R 2 is a set R of the form 

R = [a, b) x [c, d) = { (x, y) £ R 2 : a < x < b,c < y < d} 

for some a < b and c < d. The area of this rectangle is c (R) = (b — a)(d — c), 
as usual. In particular, if a = b or c = d, we see that c(0) = 0. Two- 
dimensional Lebesgue outer measure is the outer measure C~ on R 2 
defined by method I from this function c. 

2 

Theorem 5.3.1. Two-dimensional Lebesgue outer measure CP is a metric 
outer measure. 
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2 

Proof. Suppose A and B are sets with positive separation. Since C is an outer 
2 2 2 

measure, we have C (A U B) < C (A) + C ( B ). So I must prove the opposite 
inequality. 

Let D be a cover of AUB by rectangles. Now a rectangle R = [a, b) x [c, d) 
can be written as a union of the four rectangles 

[a, (a + b)/ 2) x [c, (c + d)/ 2) 

[a, (a + b)/2) x [(c + d)/2, d) 

[(a + b)/2, b) x [c, (c + d)/2) 

[(a + 6)/2, 6) x [(c+ d)/2,d ) , 

and the area of the large rectangle is the sum of the areas of the four small 
rectangles. So the sum 

E < R ) 

rgd 

is unchanged when we replace one of the rectangles by its four parts. Applying 
this repeatedly, we may assume that the diameters of the rectangles in T> are 
all smaller than dist(A,.B). Then no rectangle of T> intersects both A and B. 
So ® is a disjoint union of two families, A and 23 , where A covers A and 23 
covers B. But then 

E = E < R ) + E < R ) ^ Z ^ A ) + Z ^ B )- 

He® R£A RG'B 

So we conclude that £ 2 (A UB) > £ 2 (A) + C(B). □ 

— 2 

The sets that are measurable in the sense of Caratheodory for C~ are again 

2 

called the Lebesgue measurable sets ; the restriction of £~ to this cx-algebra 
is called two-dimensional Lebesgue measure. We will write C 2 for two- 
dimensional Lebesgue measure. 

The fact that two-dimensional Lebesgue measure is not identically zero is 
left to you: 

Exercise 5.3.2. If a < b and c < d, then a rectangle of the form R = [a, b ) x 
[c, d) is Lebesgue measurable and C 2 {R) = (b — a)(d — c). 

Now that we know that the Lebesgue measure of a square is what it should 
be, the usual scheme of approximating an area with a lot of little squares will 
show that the usual sets of Euclidean plane geometry have two-dimensional 
Lebesgue measure equal to their areas. In particular, a rectangle with sides 
not parallel to the coordinate axes has the right area. This should be enough 
to prove: 

Exercise 5.3.3. Let /: R 2 — > R 2 be a similarity with ratio r. If A C R 2 is 
Lebesgue measurable, then so is f[A], and C 2 (f[A]) = r 2 C 2 (A). 
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This will tell us something about the similarity dimension of a set ACM 2 , 
as in the one-dimensional case (Exercise 5.1.18). 

Exercise 5.3.4. Let (?r,r 2 ,--- ,r n ) be a contracting ratio list; let s be its 
sim- value; let (/i, / 2 , • • • /„) be a corresponding iterated function system in 
R 2 ; and let A C R 2 be a nonempty Borel set. Suppose C 2 (fj[A] D fk[A)) = 0 
for j ^ k, and A = (J" =1 /y[A]. If 0 < £ 2 (A) < oo, then s = 2. 

This result can be used for complex number systems in the same way as 
the corresponding result was used for real number systems in 5.1.18 

Exercise 5.3.5. Let b be a complex number, and let D be a finite set of 
complex numbers, including 0. Suppose D has k elements. Suppose every 
complex number can be represented in the number system defined by base b 
and digit set D, and the set of complex numbers with multiple representations 
has two-dimensional Lebesgue measure 0. What does this mean about the 
relationship between b and kl 


Higher Dimensions 

Let d be a positive integer. In d-dimensional Euclidean space R d , we will 
consider hyper-rectangles of the form 


R= [ai, bi) x [a 2 ,b 2 ) x x [a d ,b d ), 


where ay < bj for all j. The “hyper- volume” of this hyper-rectangle R is 

d 

C (R) = Y[( b o ~ a j)- 
1=1 

We define d-dimensional Lebesgue outer measure to be the method I 
outer measure defined from this set function c. We define d-dimension- 
al Lebesgue measure to be the restriction to the measurable subsets. As 
before, we use the notation C and C d . 

Exercise 5.3.6. The outer measure C d is a metric outer measure. If 


R= [ai, bi) x [02,62) x ••• x [a d: b d ), 


where ay < bj for all j, then 


C d (R) = n (bj 
1=1 


Uj). 
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Dyadic Cubes 

A semi-dyadic square in R 2 is a set of the form 
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' j- 1 ttl\ x 
2 n ’ 2" / 


k — 1 fc + 1 


Exercise 5.3.7. Show that the class of semi-dyadic squares is a reduced cover 
class with factor 8 for C 2 . 


A dyadic square in R 2 is a set of the form 


'jL i±l) x 

2 n ’ 2 n ) 


k k + 1\ 

2 n ’ 2 n ) ' 


For each n, let Jl n be the set of finite disjoint unions of dyadic squares with 
denominator 2 n . Then 31 = (J n CR n is called the dyadic ring in R 2 . 

Exercise 5.3.8. Using the set function c: — > [0,oo) defined by c(E) = 
C 2 (E), the dyadic ring R is a reduced cover class for 2-dimensional Lebesgue 
measure. 


After you have completed the preceding two exercises, it should be easy to 
formulate the corresponding results for d-dimensional Lebesgue measure for 
any d £ N. You would define semi-dyadic cubes, dyadic cubes, and the dyadic 
ring in R d . 


5.4 Metric Outer Measure 

Consider the following example of a method I outer measure on R; the defini- 
tion is very close to the definition used for Lebesgue measure. We begin with 
the collection A = { [a, b) : a < b } of half-open intervals and the set func- 
tion c([a, 6)) = \Jb — a. Let M be the corresponding method I outer measure. 
I claim that the interval A = [0, 1] is not measurable. 

Consider the measure of [0, 1). Certainly the singleton {[0, 1)} covers [0, 1), 
so M([0,1)) < c([0, 1)) = 1. If [0,1) C [J ieH [aj, 6j), then by what we know 
about Lebesgue measure, we must have ~ a*) > 1- So we have also 

( oo \ 2 oo 2 

y/bj - at J = y (y/bj - Qj) +2 s ^ ^/b i -a i ^ /bj - aj 

i — 1 / i = 1 i<j 

oo 

> - cti) > i- 

i— 1 


Therefore Vh — ai > 1. This shows that M([0,1)) > 1. So we have 

M([0,1)) =1. 
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Similarly M([— 1,0)) = 1. The singleton {[—1,1)} covers [—1,1), so as 
before we have M([— 1, 1)) < c([— 1, 1)) = \/2. So if A = [0, 1] and E = [— 1, 1), 


we have 


M {E n A) + M (E \A) = 1 + 1 = 2>V2> M (E). 


This shows that A = [0, 1] is not measurable. 

It is often desirable that the sets we work with be measurable sets. When 
we work with subsets of a metric space (as is common in this book), the sets 
are often open sets, closed sets, or sets constructed simply from open and 
closed sets. In particular, the sets are often Borel sets. There is a condition 
that will insure that all Borel sets are measurable sets. 

Two sets A, B in a metric space have positive separation iff dist(^4, B) > 
0; that is, there is r > 0 with g(x,y) > r for all x £ A and y £ B. Let M 
be an outer measure on a metric space S. We say that M is a metric outer 
measure iff M(^4 U B) = M(^4) + M(B) for any pair A,B of sets with 
positive separation. (Theorem 5.1.6 shows that £ is a metric outer measure.) 
The measure M obtained by restricting a metric outer measure M to its 
measurable sets will be called a metric measure. 

The reason that metric outer measures are of interest is that open sets (and 
therefore all Borel sets) are measurable sets. Before I prove this, I formulate 
the lemma of Caratheodory. 

Lemma 5.4.1. Let M be a metric outer measure on the metric space S. Let 
Ai C A 2 C • • • , and A = U jG n Ar Assume dist(A,-,A \ A,-+i) > 0 for all j. 
Then M(A) = lim^oo M(A,). 

Proof. For all j we have M(^4) > M(A,), so M(A) > linx^oo JA{Aj). (This 
inequality is true for any outer measure.) If linij^oo M(Aj) = 00, then the 
equation is true. So suppose lim^oo M(A,) < 00. 

Let B\ = A\ and Bj = Aj \ Aj_ 1 for j > 2. If i > j + 2, then Bj C Aj 
and Bi C A \ Ai_\ C4 \ Aj+ 1 , so and Bj have positive separation. So 



Since lim^oo M(Aj) < 00 , both of these converge (as m — > 00 ). So 



OO 


<M(^)+ £ M (B k ) 


k—j + 1 
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OO 

< lim M(A0 + V M (B k ). 

i — ►oo ' ^ 

k=j + l 

Now as j — * oo, the tail of a convergent series goes to 0, so we get 

M(T) < lim □ 

i — >oo 


Theorem 5.4.2. Let M be a metric outer measure on a metric space S. Then 
every Borel subset of S is JA-measurable. 

Proof. Since the cr-algebra of Borel sets is the cr-algebra generated by the 
closed sets, and since the collection IF of measurable sets is a cr-algebra, it is 
enough to show that every closed set F is measurable. Let A be any test set. I 
must show that M(A) > M(j4 fl F) + M(A \ F), since the opposite inequality 
is true for any outer measure. 

Let Aj = { x £ A : dist(x, F) > 1/j }. Then dist(A,-, F fl A) > 1/j, so 

M (A n F) + M (Aj) = M((T nf)U Aj) < M(A). (1) 

Now since F is closed, F contains all points of distance 0 from F, so A \ F = 
UjeN Aj ■ We check the condition of the lemma: If x € {A \ (F U Aj + 1 )) , then 
there exists z G F with g(x, z ) < 1/ (j + 1). If y £ Aj , then 

g{x,y) > g{y,z) - g(x,z) > - - - 

J J + 1 

Thus 

dist (4\(FU Aj + i),Aj) > - — . ^ ^ > 0. 

Therefore, applying the lemma, we get M(A\L 1 ) < lim^oo M (Aj). Taking 
the limit in (1), we get M(T fl F) + M(A \ F) < M(T), which completes the 
proof. □ 


Proposition 5.4.3. Let M be a finite metric measure on a compact metric 
space S . Let E C S be a Borel set. For any e > 0, there exist a compact set 
K and an open set U with U A E D K and M(U \ K) < e. 


Proof. Let A be the collection of all sets ECS such that for all e > 0, there 
exist a compact set K and an open set U with U A E D K and M.(U\K) < e. 

First I claim that all closed sets belong to A. Let F C S be closed and 
e > 0. Now 


XJ n = \ x £ S : dist(x,F) < 


defines open sets U n with U\ D C/ 2 A ■ ■ ■ and flngN = F. So we have 
lim^oo M(U n ) = M(F). There is n so large that M(U n ) — M(F) < e. Then: 
U n C F D F, U n is open, F is compact, and M (U n \ F) < e. 
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Clearly 0 £ A. 

Next, A is closed under complements. Let E £ A. Consider the comple- 
ment E' = S \ E. Let £ > 0. There is an open U and a compact K with 
U A E D K and M (U \ K) < e. But then U' = S\U is compact, K' = S\K 
is open, K’ D E' D U 1 , and M (K’ \ U') = M (U \ K) < e. 

Now I claim A is closed under countable unions. Let E n £ A for n £ N. 
Write E = \J ne ^E n , and let e > 0. Then there exist open XJ n and compact 
K n with U n D E n D K n and M (U n \ K n ) < e/2 n+1 . Then U = (J nGN U n is 
open. Now 

m 

Lm = 

n— 1 

is compact, increases with m, and UmeN = U n eN ^ n • There is m so large 

that 

M ( 1J K n j -M (L m ) < £ ~. 

Vn6N / 

So we have U 0 E D L n and 


M{U\L n ) <M ( (J (U n \ K n ) j +M ( (J K n ) -M (L m ) 

VnGN / VnGN / 


< 


E 


£ £ 

+ — = £ 

2 n +i T 2 


Therefore A includes at least the Borel sets. 


□ 


Method II 

We have seen that method I may fail to yield a measure where open sets 
are measurable. There is a related construction, called “method II” that will 
overcome this difficulty. 

Let A be a family of subsets of a metric space S, and suppose, for every 
x £ S and e > 0, there exists A £ A with x £ A and diamA < e. Suppose 
c: A — > [0, oo ] is a given function. An outer measure will be constructed based 
on this data. For each £ > 0, let 

A e = { A £ A : diam A < e } . 

Let M e be the method I outer measure determined by c using the family A e . 
Then by Proposition 5.2.3(a), for a given set E , when e decreases, M e (E) 
increases. Define 

M (E) = lim M e (E) = supM e (-E). 

£>0 

It is easily verified that M is an outer measure. As usual, we will write M for 
the restriction to the measurable sets. This construction of an outer measure 
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M from a set function c (and a measure M from M) is called method II. It is 
more complicated than method I, but (unlike method I) it insures that Borel 
sets are measurable: 

Theorem 5.4.4. The set function DVC defined by method II is a metric outer 
measure. 

Proof. Let A,BCS with dist(^4, B) > 0. Since M is an outer measure, we 
have M(A UB) < M(T) + M (B). So I must prove the opposite inequality. 

Let e > 0 so small that e < dist(T, B). Let T> be any countable cover of 
A U B by sets of A e . The sets D £ T) have diameter less than dist(A, B ), so 
such a set D intersects at most one of the sets A,B. Therefore, T> may be 
divided into two disjoint collections, T>i and I> 2 , where T>i covers A and T> 2 
covers B. Then 

£ = £ c (D) + £ c(D) >M E (A)+M e {B). 

De D DgB 1 De d 2 

Now we may take the infimum over all covers, and conclude M e (A U B) > 
M e (A)+M e (B). Then we may take the limit as e — > 0 to conclude M(TUU) > 
M(A) + M(B). ' □ 

Exercise 5.4.5. Let S' be a metric space, and let c be a set function. For a 
collection of sets A, let M a be the method II outer measure defined using the 
class A of sets and the restriction of c to A. 

(a) If 3 C A, then Ma < M®. 

(b) Suppose that, for every 77 > 0 there is 6 > 0 such that for all A £ A with 
diamT < <5, and every e > 0, there is B £ 23 with diaml? < 77 , B D A, 
and c (B) < c(A) + e. Then < JA-a- 

(c) Let C > 0 be a constant, and suppose that, for every 77 > 0 there is S > 0 
such that for all A £ A with diam A < S , there is B £ 3 with diam B < 77 , 
BAA, and c(B ) < Cc(A). Then M® < CM. a- 

When condition (b) holds, we will say that CB is a reduced cover class 
for M. When condition (c) holds, we will say that 3 is a reduced cover class 
with factor C for JVC. 


5.5 Measures for Strings 

One of the useful ways we will employ the material of this chapter is by 
defining measures. Sometimes (for example Lebesgue measure or Hausdorff 
measure) we will define a measure on subsets of Euclidean space R d . But also 
measures will be defined on our string models and path models. 
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Ail Example 


We will consider an easy example before we attack the more general case. 

Begin with the two-letter alphabet E = {0,1}- Consider, as usual, the 
metric space of infinite strings with metric £> 1/2 ■ We will construct a 
measure on E^\ We begin with the family of “basic open sets”: 

A = { [a] : a £ E& } 


together with the set function c: A — > [0,oo) defined by 


c(W) 


1 

2H ' 


(Recall the notation |a| for the length of the string a.) 

Proposition 5.5.1. The method I outer measure Mp 2 constructed using this 
function c is a metric outer measure and satisfies M 02 ([a]) = c([a]) for all 
a £ £(*). 


Proof. Write A = { [a] : a £ E^ }, A e = {D £ A : diamD < e}. Let N e 
be the method I measure defined by the set function c restricted to A e . If 
D £ A e , then of course c{D) > M 1 / 2 (L ) ), so by the Method I theorem, 


W) > M 1/2 (A) 


for all A. Therefore the method II measure N defined by 

N(A) = limN e (A) 


satisfies N(A) > M 1 / 2 (^4)- 

Note that, for any a £ E^*\ if k is the length |a|, then we have 

<=([“]) = ^ = + '<[«»])+ '«“!])• 

Applying this repeatedly, we see that, for any e > 0, any set D £ A is a finite 
disjoint union D\ U D 2 U • • • U D n of sets in A e with c (D) = ^c(JDj). This 
means that 3sf e (D) < c (£>), so by the Method I theorem, N e (A) < Mi/ 2 (A) 
for all sets A. So N(A) < Mi/ 2 (A) 

Therefore M 1/2 = N is a method II outer measure, so it is a metric outer 
measure. □ 

Exercise 5.5.2. Let h: E^> — > R. be the “base 2” addressing function defined 
on p. 14. If A C E^ is a Borel set, then 3Vt \/ 2 {A) = C(h[A]). 
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Measures on String Spaces 

Let E be a finite alphabet with at least two letters. Consider the space E 
of infinite strings. This space is a metric space for many different metrics 
g. But all of the metrics constructed according to the scheme in Proposi- 
tion 2.6.5 produce the same open sets. A countable base for the open sets is 
{ [a] : a £ }. Also is a compact ultrametric space. An important 

feature of all these metrics is lirn^oo diam[<r|"A;] = 0 for each a £ E^ u \ 

Exercise 5.5.3. It follows that lim^^ (sup { diam[a] : a £ EW }) = 0. 

Suppose a non-negative number w a is given for each finite string a. Under 
what conditions is there a metric outer measure M on with M([a]) = w a 
for all a? Since it is a metric outer measure, the open sets [a] are measurable, 
so M is additive on them. Now the set [a] is the disjoint union of the sets [/3] 
as / 3 ranges over the children of a (that is, /3 = ae for e £ E). 

Theorem 5.5.4. Suppose the non-negative numbers w a satisfy 

w a =^2 ®«e 

eG-E 


for a £ E M . Then the method I outer measure defined by the set function 
c([a]) = w a is a metric outer measure M on E^ with M([a]) = w a . 

Proof. Write A = { [a] : a £ E^ }, A e = {D £ A : diam D < e}. Let N e 
be the method I measure defined by the set function c restricted to A e . If 
D £ A e , then of course c (D) > M (D), so by the Method I theorem, 

N e (A) >M(A) 

for all A. Therefore the method II measure N defined by 

N(A) = limN e (A) 

£—>0 


satisfies N(A) > M(A). 

Note that, for any a £ E^*\ we have 

C (H) =W a = ^2 W <*e = ^2 C (M)‘ 
eG E eG E 

Applying this repeatedly, together with Exercise 5.5.3, we see that, for any 
e > 0, any set D £ A is a finite disjoint union D\ U £>2 U • • • U D n of sets in 
A e with c(D) = ^c(Di). This means that N e (H) < c(D), so by the Method 
I theorem, N E (A) < M(A) for all sets A. So N(A) < M(A). 

Therefore M = N is a method II outer measure. So we may conclude that 
it is a metric outer measure. □ 
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How should we formulate the corresponding theorem for the path spaces 
Ev ^ defined by a directed multigraph (V, E, i, t)l We will define measures for 
each of the spaces Ev . We only need to define one of them at a time. 

Fix a vertex v. Suppose nonnegative numbers w a are given, one for each 
a £ E^*\ They should (of course) satisfy 

W a = ^2 W ae 

t(e)=i(a) 

for a £ E), ’ . Note that this has consequences for the troublesome exceptional 
cases that came up when we were defining the metric. If a has no children, 
then (interpreting an empty sum as 0), we see that w a = 0. Similarly, if a has 
only one child f3, then w a = wp. 

Exercise 5.5.5. Suppose the non-negative numbers w a satisfy 

w a = ^2 W <*e 

i(e)=t(a) 

for a £ Ey . Then the method I outer measure defined by the set function 
c([a]) = w a is a metric outer measure M on with M([a]) = w a . 

5.6 * Remarks 

Henri Lebesgue’s measure and integration theory dates from about 1900. It 
is one of the cornerstones of twentieth century mathematics. I have not dis- 
cussed integration at all, in order to reduce the amount of material to its bare 
minimum. (My more advanced text [18] develops integration using the same 
ideas.) The more abstract measure theory was developed by many others, 
such as Constantin Caratheodory, during the early 1900’s. Theorem 5.5.4 
on the existence of measures on the string spaces is due essentially to A. N. 
Kolmogorov. 

The cr-algebra T generated by a family T> of sets (as in Theorem 5.2.1) is 
a complicated object to describe constructively. The proof given for 5.2.1 has 
the advantage of not requiring such a constructive description. Certainly T 
contains all countable unions 

Ua 

jGN 

with Di £ CD; it contains complements of those unions; it contains countable 
intersections 

O* 

ie n 

where each Ei is either a countable union or a complement of a countable 
union. But that may not be everything in 3 r . (See, for example, the proof of 
Theorem (10.23) in [34].) 
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An example of a set in the line not measurable for Lebesgue measure may 
be found in many texts. For example: [7, pp. 36-37], [11, Theorem 1.4.7], [34, 
(10.28)], or [58, Chap. 3, Sect. 4], 

Exercise 5.3.5: k = |6| 2 . 

Exercise 5.5.2. Both measures are method I measures; use the Method I 
theorem twice, once to prove an inequality in each direction. 


0, wiste a man how manye maladyes 
Folwen of excesse and of glotonyes 
He wolde been the moore mesurable 
— G. Chaucer, The Pardoner's Tale 
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Fractal Dimension 


Next we come to the fractal dimensions: in particular the Hausdorff dimension 
and the packing dimension. The surprising feature for these dimensions is that 
they need not be integers: they can be fractions. The Hausdorff dimension is 
the one singled out by Mandelbrot when he defined “fractal’.’ The Hausdorff 
and packing dimensions are perhaps a bit more difficult to define than some 
of the other kinds of fractal dimension. But in recent years it has become clear 
that they are the most useful of the fractal dimensions. 


6.1 Hausdorff Measure 

Let S' be a metric space. Consider a positive real number s, the candidate 
for the dimension. The s-dimensional Hausdorff outer measure is the 
method II outer measure defined from the set function c s (A) = (diamA) s . It 
is written TC . The restriction to the measurable sets is called s-dimensional 
Hausdorff measure , and written !K S . Since Jf is constructed by method II, 
it is a metric outer measure. So all Borel sets are measurable (in particular, 
all open sets, closed sets, compact sets). 

Recall that the Method I theorem gives a more explicit construction: A 
family A of subsets of S is called a countable cover of a set F iff 

FC U A, 

AeA 

and A is a countable (possibly even finite) family of sets. Let e be a positive 
number (presumably very small). The cover A is an e-cover iff diamA < e 
for all A € A. Define 

J~C^(F) = inf (diamA) 5 , 

AeA 

where the infimum is over all countable e-covers A of the set F. (By conven- 
tion, inf 0 = oo.) A computation shows that when e gets smaller, J~C e (F) gets 
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larger. Finally: 

(E) = limff{*(E) = supW*(F) 

E ^° £>0 

is the s-dimensional Hausdorff outer measure of the set F. Figures 6.1.1 and 
6.1.2 illustrate some of the ideas behind the definition. 

There are variants in the definition of the Hausdorff measure that are 
sometimes useful, (i) Since the closure of a set has the same diameter as the 
set itself, we may use only closed sets in the covers A. The class of closed sets 
is a (method II) reduced cover class for 3~C S . (ii) If A is any set, it is contained 
in an open set with diameter as close as I like to the diameter of A. The class 
of open sets is a reduced cover class for 0~C S . (iii) Any set of diameter r is 
contained in a closed ball of radius r (and diameter < 2r) . The collection of 
open balls is a reduced cover class with factor 2 s for !H S . (iv) In Euclidean 
space R d , the convex hull of any set has the same diameter as the set. The 



Fig. 6.1.1. The Hausdorff measure (area) of a piece of surface A is approximated 
by the cross-sections of little balls which cover it. (From [52]) 



Fig. 6.1.2. One must cover by small sets to compute length accurately. Here the 
length of the spiral is well-estimated by the sum of the diameters of the tiny balls, 
but grossly under-estimated by the diameter of the huge ball. (From [52]) 
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collection of convex sets is a reduced cover class for 3~C S . (v) If a set K is 
compact, then every open cover of K has a finite subcover, so to compute the 
Hausdorff measure of a compact set K, we may use finite covers A. (vi) If we 
replace a set in a cover A of a set F by a subset of itself, so that the result is 
still a cover of F, the sum 

^](diamA) s 

Ae-A 

only becomes smaller. So if F C T C S, the value of !K*(F) when F is 
considered to be a subset of T is the same as when F is considered to be a 
subset of S. In particular, we may assume (if it is convenient) that the sets 
used in the covers A of the set F are subsets of F. 

Exercise 6.1.3. If F is a finite set, then 1K S (F) = 0 for all s > 0. 

Theorem 6.1.4. In the metric space M, the one- dimensional Hausdorff mea- 
sure IK 1 coincides with the Lehesgue measure £. 

Proof. If A C M has finite diameter r, then sup A— inf A = r, so A is contained 
in a closed interval I with length r, and £(A) < £(I) = r. But by the Method I 

theorem (5.2.2), IK* is the largest outer measure M satisfying M(A) < diam A 
for all sets A with diameter less than e. So !K e (F) > £(F) for all F. Therefore 
H.\f) >£(F). 

Now if [a, b) is a half-open interval and e > 0, we may find points a = 
Xq < Xi < ■ ■ ■ < x n = b with Xj — Xj - 1 < £ for all j. Then [a, b) is covered by 
the countable collection { [xj-i,Xj] : 1 < j <n} 7 and 

n n 

dia.m[xj-i,Xj] = — Xj_ i) —b — a. 

1=1 1=1 

Therefore fK^([a, &)) < b — a. But by the Method I theorem, £ is the largest 
outer measure satisfying C(\ L a 1 6)) < b — a for all half-open intervals [a, b). 
Therefore £(F) > df 1 (E). for all F. 

The two outer measures £ and !K coincide. The measurable sets in each 
case are given by the criterion of Caratheodory, so the measures £ and J-C 1 
also coincide. □ 

For a “zero-dimensional” Hausdorff measure, we can use the set function 
c 0 defined by c 0 (H) = 1 for Aff0 and c o (0) = 0. 

Exercise 6.1.5. With this definition, 3f 0 (H) = n if A has n elements, and 
3~C°(A) = oo if A is infinite. 

Hausdorff Dimension 

How does the Hausdorff measure 3f s (F) behave as a function of s for a given 
set FI An easy calculation shows that when s increases, df s (F) decreases. 
But much more is true. 
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Theorem 6.1.6. Let F be a Borel set. Let 0 < s < t. If !K S (F) < oo, then 
J<f(F) = 0. If > 0, then 1K S (F) = oo. 

Proof. If diam A < e, then 

tK e (A) < (diamH)* < e t_s (diam T) s . 

Therefore by the Method I theorem, S CFC^ (/ 7 ’) for all F. Now if 

3< S (F) < oo, then J£*(F) < lim e ^ 0 £ t_s 5f*(F) = 0 x X S (F) = 0. The second 
assertion is the contrapositive. □ 

This means that, for a given set F, there is a unique “critical value” 
s 0 £ [0, oo] such that: 


tK s (F) = oo 
M S (F) = 0 


for all s < so; 
for all s > sq. 


This value so is called the Hausdorff dimension of the set F. We will write 
so = dimF. Of course, it is possible that !H S (F) = 0 for all s > 0; in that case 
dim F = 0. In the same way, it is possible that !K S (F) = oo for all s; in that 
case dim F = oo. 

This idea of dimension is an abstraction of what we already know from 
elementary geometry. If A is a nice smooth rectifiable curve, then its length 
is a useful way to measure its size; but its “area” and “volume” are 0. The 
dimensions 2 and 3 are too large to help in measuring the size of A. If B is 
the surface of a sphere, then its area is positive and finite. We can say its 
“length” is infinite (for example, since it contains curves that are as long as 
we like which spiral around); its “volume” is 0, since it is contained in a solid 
shperical shell whose thickness is as small as we like. So for the set B 1 the 
dimension 1 is too small, the dimension 3 is too large, and the dimension 2 is 
just right. The s-dimensional Hausdorff measure give us a way of measuring 
the size of a set for dimensions s other than the integers 1, 2, 3, • • • . 

Theorem 6.1.7. Let A,B be Borel sets. 

(1) If AC. B, then dim A < dim B . 

(2) dim(H U B) = maxjdim A, dim B} . 

Proof. (1) Suppose A C B. If s > dim!?, then r K s {A) < 3~C S (B) = 0. Therefore 
dimH < s. This is true for all s > dimS, so dimH < dim B. 

(2) Let s > maxjdimA, dim.B}. Then s > dimH, so 3~C S (A) = 0. Similarly, 
3< S (B) = 0. Then df s (HUH) < (A) + (B) =0. Therefore dim(HUF) < 

s. This is true for all s > maxjdimA, dim 5}, so we have dim(H U B) < 
max{dimH,dimH}. By (1), dim(H U B) > max{dimH, dimH}. □ 

Exercise 6.1.8. Suppose Ai, A 2 , ■ ■ ■ are Borel sets. Is it true that 


dim |^J Ak = supdimHfc? 
fee N k 
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Theorem 6.1.9. Let f:S^>T be a similarity with ratio r > 0, let s be a 
positive real number, and let F C S be a set. Then Di ( f\F 1) = r s 0~C (F). So 
dim f[F\ = dim F. 

Proof. We may assume that T = f[S]. Then / has an inverse / _1 . A set 
ACS satisfies diam f[A\ = rdiamA. Therefore (diam/[A]) s = r s (diamA) s . 
By the Method I theorem (applied twice), TC s re (f[F]) = r s Jfg(F). Therefore 
J{ S (/[F]) = r s 5?(F) and dim/[F] = dimF. " □ 

Exercise 6.1.10. Suppose /: S' — > T is a function. Let A C S be a Borel set. 
Prove or disprove: (1) If / is Lipschitz, then 

dim f[A } < dim A. 

(2) If / is inverse Lipschitz, then 

dim f[A } > dim A. 

Exercise 6.1.11. Suppose S is a metric space and dimS < oo. Does it follow 
that S is separable? 


6.2 Packing Measure 

In this section we define the packing measures and the packing dimension. 

Mandelbrot says that his definition for “fractal” (Cov S < dim S) is too 
broad, in that it admits “true geometric chaos’.’ The sets that are of interest 
for applications (and in mathematics) are generally not the most general set, 
with few special properties. So it may be useful to restrict the term “fractal” 
so that the sets meeting the conditions have useful properties. One possible 
way to do this has been proposed by James Taylor. He proposed to apply the 
term “fractal” to (Borel) sets where the packing dimension is equal to the 
Hausdorff dimension. 


Motivations 

Before we formulate the definition of the packing measures, let us discuss some 
of the reasons for the definition, and why it has the form given. 

Hausdorff measure is based on “covering” of a set. The set E to be mea- 
sured is covered by small sets Aj. We attempt to make the covering “efficient” 
by minimizing 

ieN 

subject to the constraint that the sets A; cover E. When this sum is smaller, 
the cover of E by {A^} is considered to be more efficient. 
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Another possibility for “measuring” the set E is to do it by packing rather 
than covering. We want to put disjoint sets Aj inside E. We attempt to make 
this packing “efficient” by maximizing 

£GN 

subject to the constraint that the sets Aj are disjoint subsets of E. When this 
sum is larger, the packing {Aj} is considered to be more efficient. 

For fractal measures, the set function c should be of the form (diamA) s , 
where s > 0 is the dimension we are interested in. But this leads to certain 
undesirable features if taken at face value. For example, in the plane, what if 
we pack a square as in Fig. 6.2.1(a)? By making the sets Aj narrow enough, 
we can make the sum 

^(diamAj) 8 , 

iGN 

as large as we like. 

The way to avoid this is to pack only by sets of a special type. For example, 
in K, packings with intervals cannot be beat. In Euclidean space, the choice is 
often to pack with cubes. In order to get a definition that applies in a general 
metric space, we will pack with balls. 

Packing a set E with balls Aj C E is fine when E is an open set, but other 
sets may contain no balls at all. So we drop the requirement that the balls be 
contained in E. But to make sure the packing measures the set E we require 
instead that the centers of the balls lie in E. 

Let S' be a metric space, x £ S and r > 0. Recall the notation 

B r (x) = {y € S : g{x, y) < r } , B r (x) = {y £ S : g(x, y) < r } . 

We will pack with closed balls. But open balls could be used just as well in 
our setting. 

For two balls B r (x), B s (y) in Euclidean space, we know that they are 
disjoint, B r (x) D B s (y ) = 0, if and only if g(x,y) > r + s. In metric space 
other than Euclidean space, this equivalence may fail. But we do know that 
if g(x, y) > r + s, then B r (x) D B s (y) = 0. We will use g(x, y) > r + s for our 
definition of “packing’.’ 



Fig. 6.2.1. (a) Packing with any sets (b) packing with balls 
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In Euclidean space, two balls are equal, B r {x) = B s (y) : if and only if x = y 
and r = s. In metric space other than Euclidean space, this equivalence may 
fail. For example, in an ultrametric space, every point of a ball is a center. This 
is a reason for our use of “constituents” rather than balls in the definition. 

In Euclidean space, the diameter of the ball B r (x ) is 2r. In metric space 
other than Euclidean space, this may not be true. For example, in an ul- 
trametric space, &\am.B r (x) < r. We will use the set function (2r) s for our 
“radius-based” packing measure rather than the “diameter-based” option of 
(diamB r .(a;)) s . 

In some texts- including the first edition of this one — one or more of the 
above choices may be reversed. As noted, in Euclidean space this makes no 
difference. 


Definition 

Let S' be a metric space. A constituent in S is a pair (x,r), where x £ S 
and r > 0. We think of the constituent (x,r) as standing for the closed ball 
B r (x) . We may even call x the “center” and r the “radius” of the constituent 
(x,r). 

Let ECS. A packing of El is a countable collection 77 of constituents, 
such that: (a) for all ( x,r ) £ 77, we have x & E; (b) for all (x,r),(y,s) £ 77 
with (x, r ) ytz (y, s), we have g(x, y) > r + s. 

For 6 > 0, we say that a packing 77 is 5-fine iff for all (x, r) £ 77 we have 
r < 5. Let F C S, and let 6, s > 0. Define 

r S (F) = sup E (2 r) s , 

{x,r)^II 

where the supremum is over all (5-fine packings 77 of F. Note: because of the 
sup involved, we may restrict this to finite packings 77 only. 

When 5 decreases to 0, the value T|(F) decreases, so we define 

W = lim 9|(F) = inf 92(F). 

o — *0 d >0 

When we have done this, we get a family (Tg) of set functions indexed by s. 
As before, there is a critical value: 

Exercise 6.2.2. Let F be a set in a metric space. There is s o £ [0,oo] such 
that 


= 00 f° r s < so; 

fPg(F) = 0 for all s > s o- 

This critical value So will be called the packing index of the set F. 
However, the set functions Tg are not really what we want. They are not 


172 


6 Fractal Dimension 


outer measures. This is not unexpected, since the process used to construct 

~l/2 

them is not method II. Here is an illustration showing that Jg fails to be an 
outer measure on R: 

Proposition 6.2.3. Let K be the compact set {0, 1, 1/2, 1/3, 1/4, 1/5, • • • } C 
M. Then ^(K) > 0. 

Proof. Let k £ N be odd, let e = 2~ k , and let n = 2^ k ~ 1 ^ 2 . Then 


so the constituents with radius e and centers 1, 1/2, 1/3, • • • , 1/n form a pack- 
ing. (That is, the balls with radius e and centers 1, 1/2, 1/3, • • • , 1/n are dis- 
joint.) So 

T\ /2 {K) >n( 2e) 1/2 = 1, 

and therefore K ) >1. □ 

For many purposes it is unreasonable to claim that this countable set K 
has positive dimension. We know a good way (method I) to get an outer 
measure from a set function. So we apply method I to the set function Tg: 

? S (E) = inf £ T 0 (C), 
cee 

where the inf is over all countable covers 6 of the set E. 

Theorem 6.2.4 (The closure theorem). If C is a set and C is its closure, 
then 'Pg(C') = ?g(C). 

Proof. Airy packing of C is automatically ajpacking of C. This shows that 
T 5 (C) < ?f(C) for all <5 and thus Tg(C) < Tg(C). 

Conversely, let <5 > 0 and let 77 be a finite 5-fine packing of C. Write 77 = 
{(a?i, ri), • • • , (x n , r n )}. For any i ^ j, we have g(xi, Xj)~Ti~rj > 0, and there 
are only finitely many pairs i,j, so there is e > 0 with g(xi, x f) — ri — rj > £ 
for all i ^ j. Now for each i , the point Xi belongs to the closure of C , so 
there is yi G C with g(xi,yi) < e/2. But then 77' = {(j/i,ri), • • • , (y n ,r n )} is 
a packing of C, still 5-fine, and it has the same value as the packing 

77. Therefore we get TJ (C) > ( C ) for all 5 and thus Tg (C) > Tg ( C ) . □ 

The class of closed sets is a reduced cover class for !P S : 

Corollary 6.2.5. Let ECS. Then 

y\E) = inf £ T 0 (C), 
cee 

where the inf is over all countable covers C of the set E by closed sets. 


6.2 Packing Measure 173 


Lemma 6.2.6. LetA,B C S. Then B) < ^(Aj + fPj^T?). 7/dist(A, B) 

> 0, then ?g(A U B) = T 0 (A) + T 0 (B). 

Proof. Let 5 > 0 be given. Let II be a 5-fine packing of A\JB. Then II is the 
disjoint union of 

III = { (%, r) £ II : x £ A} and II 2 = { (x, r) £ II : x A } . 

But III is a 5-fine packing of A and 77 2 is a 5-fine packing of B. So 

£ (2 r) s = £ (2 rY+ £ (2 r)° <T S (A) +T S (B). 

(x,r)£ll (x,r)ElZi (x,r)Ei72 


Take the supremum over all 5-fine packings 77 to get 1P|(A U B) < 1P|(A) + 

T S (B). Let 5 -> 0 to get Tg(A U B) < Tg(A) + T 0 {B). 

Let dist(A, B) = e > 0. Then if 5 < e/2, any 5-fine packing of A U B is 
the disjoint union of a 5-fine packing of A and a 5-fine packing of B. And 
conversely, the union of a 5-fine packing of A and a 5-fine packing of B is a 
5-fine packing of A U B. So T|(A U B) = T|(A) + T|(77). Let 5 — > 0 to get 
Tg(Au73) = ?5(A) + ?g(S). □ 

Theorem 6.2.7. The set function tP is a metric outer measure. 

Proof. The only packing of the empty set is the empty packing, and an empty 
sum has the value 0. Therefore J’|(0) = 0 for all 5 > 0 and Jq( 0) = 0- 
The empty set can be covered 0 C (J ngN 7? n , where E n = 0 for all n, so 
? S (0) = 0 . 

If A C B, and B C (J neN E n , then also A C \J n&n E n , so ? S (A) < ?\B). 
Suppose A = U ieN Ai. We must show ? S (A) < If Yhi 

diverges, then there is nothing to do, so assume ^ (^i) < 00 . Let e > 0 
be given. For each i, there exist sets E nil n £ N, so that Ai C \J n E ni and 
Yf n 'Pf){E ni ) < T S (Ai) + e/2*. Then A C (J. (J n E ni is a countable cover of A , 
so 

3> S (A) < ££?g(7; m ) < £ (t s (a,) + ^) = (£ +e. 

in i \ i / 

This holds for any e > 0, so T S (A) < J2i ^ 

The metric property follows from Lemma 6.2.6. □ 

The restriction of tP to the measurable sets is a measure, called the s- 
dimensional packing measure , and written 1P S . As usual there is a critical 
value for s: 
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Exercise 6.2.8. Let F be a Borel set in a metric space. There is so £ [0,oo] 
such that 


7 S (F) = oo for all s < so; 

T S (F) = 0 for all s > so- 

This value sq is called the packing dimension of the set F. We will write 
so = DimF. It is a more reasonable quantity than the packing index defined 
above. 

Elementary Properties 

The packing dimension has many of the same properties as the Hausdorff 
dimension. 

Theorem 6.2.9. Let A, B be Borel sets. 

(1) If A C F, then Dim A < DimF. 

(2) Dim( A U B) = max{Dim A, Dim Bj . 

Proof. (1) Assume A C B. Let s > DimF. then CP S (F) = 0. Therefore 
iP s (A) = 0. This shows Dim A < s. This holds for all s > DimF, so 
Dim A < DimF. 

(2) By (1), Dim(A U F) > Dim A and Dim(A U F) > DimF. There- 
fore Dim(A U F) > max{DimA, DimF}. If s > max{Dim A, DimF}, then 
fP S (A) = 0 and fP s (F) = 0. So by subadditivity, ? S (A U F) = 0. This shows 
Dim(A U F) < s. It holds for all s > max{DimA, DimF}, so Dim(A U F) < 
max{DimA, DimF}. □ 

Exercise 6.2.10. Suppose Ai, A 2 , • • • are Borel sets. Is it true that 

Dim |^J Afc = sup Dim A*,? 
ke N k 

Theorem 6.2.11. Let f:S^T be a similarity with ratio r > 0, let s be a 
positive real number, and let E C S be a set. Then 7 (/[F]) = r s fP (F). So 
Dim/[F] = DimF. 

Proof. Let LI he & Ffine packing of F. Then { ( f(x),rt) : (x.t) £ 17 } is an 
r 5-fine packing of f[F]. So 

Ks(f[F})> E ( 2 rt Y=r s E ( 2 t Y • 

(x,t)en ( x,t)€n 

This holds for all 5-fine packings of F, so 

r rS {f[F))>r s r s (F). 
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Let S — > 0 to get 

T 0 (f[F})>r s T 0 (F). 

This holds for all subsets F C S. 

Since r > 0, the map / is one-to-one, and maps F onto f[F]. Every rS- 
fine packing of f[F] is of the form { (/(x),rf) : (x,i) £ 77} for some (5-fine 
packing of F. So the estimates hold in reverse, and we conclude 

T 0 {f[F})=r s r 0 (F). 

Now if E C (J. Ai is a countable cover of a set E, then f[E] C (J. f[Ai] is 
a cover of the image set f[E}. So 

E^o(^) = > r s ? S (f[E}). 

i i 

This holds for all covers of E, so iP* (if) > r s ? s (/[77]) . 

If f[E\ C [J Bi is a countable cover of f[E\, let Ai = / -1 [£?*], so that 
E C (J Ai is a cover of E. Note f[Ai\ C B^. Now 

E (b^ > r s e nviM) = E ^o(^) > y\E). 

i i i 

This holds for all covers of f[E\, so r s fP s (/[if]) > 

Therefore, we have T (E) = r s T (/[E5]). And Dim/[i?] = Dimi?. □ 

Exercise 6.2.12. Suppose /: S' — > T is a function. Let A C S be a Borel set. 
Prove or disprove: (1) If / is Lipschitz, then 

Dim f[A] < Dim A. 

(2) If / is inverse Lipschitz, then 

Dim f[A] > Dim A. 

Proposition 6.2.13. In the metric space R, the one- dimensional packing 
measure T 1 coincides with Lehesgue measure C. 

Proof. First consider a half-open interval, [a, b). If 17 is a finite packing of 
[a, b), write 77 = {(xi, ri), (x 2 , r 2 ), ■ ■ ■ , (x„,r n )} with x\ < x 2 < • • • < x n . 
Then all the balls B ri (x*) are contained in the interval [a — r \ , b + r n \ and are 
disjoint. By the additivity of Lebesgue measure (and the fact that intervals 
are Lebesgue measurable sets), we have 

n 

EM < b - a + r 1 + r n . 

7=1 


If II is (5-fine, then 
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^^( 2 rf) < b — a + 26. 

i=l 

Take the supremum on all 5-fine packings of [a, b) to conclude 

y] ([a, b)) < b — a + 26. 

Let 6 — > 0 to get 

2 o([a,£>)) <b — a. 

On the other hand, given 5 > 0, choose n with (6 — a)/n < 6 , then we can 
pack [a, 6) with n balls all of radius r; = (b — a)/{2n). So CPj([a, &)) > b — a. 
Take the limit to get CPj([a, b )) > b — a. Therefore CP J([a, b)) = b — a. 

Now consider a finite disjoint union of half-open intervals [a, b). If two 
of them are adjacent (the right endpoint of one is the left endpoint of the 
other), then they may be combined into a single interval. If two of them are 
not adjacent, then there is a gap of positive length between them. So by 
Lemma 6.2.6 we have CPj(P) = C( V) for all such finite disjoint unions. This 
holds in particular for the dyadic ring CR defined on p. 150. 

Fix a large N > 0 and consider C and IP for subsets of [-N, AT], We claim 
that CR is a reduced cover class for IP . Given any closed set F C [—AT, AT] and 
any e > 0, there is V € !R with V D F and C(V \ F) < e/2; then there is 
U G y with u 2 v \ F and C(U \(V\ F)) < e/2. Then 

$o(y) < ^o( F ) + vtiv \ F ) < yl( F ) + vtiu) 

<yl(F) +C(U) <Tj(F)+e. 

Now the closed sets form a reduced cover class for IP , so this shows that the 
dyadic ring 3? also forms a reduced cover class for IP in [— N, A/]. 

We have seen that CPj and C agree on so their method I measures also 
agree: C = IP 1 for subsets of \—N, A/ - ]. For a general subset A of K, take the 
increasing limit of [—AT, W] D A. So £ = IP 1 . □ 

The packing dimension is related to the Hausdorff dimension.* 

Proposition 6.2.14. Let S be a metric space and F C S a Borel set. Then 
JG(F) < 2 S T S (F) and dimP < DimF. 

Proof. I first show that TC 4 e (F) < 2 S 1P*(F). Now if CP 1(F) = 00 , then this 
is clear. So suppose C P*(F) < 00 . If there were an infinite packing of F with 
all radii equal to e, then CP 1(F) = 00 . So there is a maximal finite packing 
- ■ ■ ,(x n ,s)} of F. Then CP®(F) > n(2e) s . By the maximally, 

* In the first edition, this was stated only in Euclidean space — one of the drawbacks 
of the diameter-based definition of the packing measure. 
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for any x £ F, there is some i between 1 and n with g(x, Xi) < 2e. So the 
collection { B 2e (xi) : 1 < i < n} covers F, and 

71 

% e (F) < (diam B 2e { Xi )) a < n{ 4e) s = 2 s n(2e) s < 2 s T e {F). 

i=l 

Therefore % e (F) < 2 S T®(F). 

Now take the limit as e — * 0 and conclude 1K S (.F) < 2 s 1Pq(.F). So by the 
Method I theorem, J{ S (F) < 2 S ? S (F). 

Now if s < dimF, then J{ S (F) = oo, so IP S (F) = oo, and therefore s < 
DimT 1 . We therefore conclude that dimF < DimF. □ 

A set F C is a fractal (in the sense of Taylor) iff dim F = Dim F. 


6.3 Examples 

According to Mandelbrot’s definition, a fractal is a set A with Cov A < dim A. 
According to Taylor’s definition, a fractal is a set A with dim A = Dim A. In 
order for these definitions to be useful, we will have to be able to compute the 
dimensions involved. In some cases this is not easy to do. 

In this section, we will do a few examples directly from the definitions. 
We will carry out the calculations in great detail. In Sect. 6.4 we will discuss 
self-similar sets in general. 


Binary Tree 

Here is our first official example of a fractal. We computed ind{0, 1}^ = 0 
in Theorem 3.4.4. For {0, l}(“d we will use the metric g i/ 2 defined on p. 44 
and the measure M 1/2 defined on p. 160. Recall the notation [a] for cylinders 
from p. 13. 

Proposition 6.3.1. Let E = {0,1} be a two-letter alphabet, let be the 
space of all infinite strings using E, and let g\/ 2 be the metric for E ^ . Then 
TC 1 = Mp 2 and dim E^ = 1. 

Proof. To prove that IK 1 = we will use two applications of the Method 

I theorem. 

If a set A C E’l'A has positive diameter, then (Proposition 2.6.7) there 
is a string a £ E^ with A C [a] and diam A = diam [a]. So Mi/2(A) < 

3Vti/2([a]) = diam[a] = diam A. But is the largest outer measure with 
l K e (A) < diam A for all sets A of diameter < e. So M 1/2 < < K e . This is true 
for all e > 0, so M.\/ 2 < “K 1 . 
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On the other hand, let a £ E (*' be a finite string, and e > 0. There is n 
so large that 2~ n < e and n > |ct|, the length of a. Then the basic open set 
[a] is the disjoint union of all sets [/?], where (3 > a and |/3| = n. There are 
2 n- l“l of these sets. Then 

E(M) < E diam ^] = E 2_n = 2 " 1a| - 

/3>a 0>a 

\(3\ = n \(3\=n 


But Mp 2 is the largest outer measure with M]/ 2 ([a]) < 2 _ l“l for all a £ E^*\ 
So E — 3 Vti/ 2 , and thus df 1 < M 1 / 2 - 

Therefore Jf 1 = M 02 . The measurable sets in both cases are given by the 
criterion of Caratheodory, so K 1 = Mi/ 2 . 

Now since 0 < < oo, we conclude that dim = 1. □ 

So we know that {0, is a fractal in the sense of Mandelbrot. It is also 
a fractal in the sense of Taylor: 

Proposition 6.3.2. Let E = {0, 1} be a two-letter alphabet, let E^ be the 
space of all infinite strings using E, and let Q 1/2 be the metric for E^ . Then 
T 1 = 4M 1/2 and Dim E^ = 1. 


Proof. The outer measure M = Mi / 2 is the largest outer measure such that 
M([a]) < 2“I Q I for all a £ £<*>. In fact, M([a\) = 2“H 

We will describe the balls in E^\ Let a £ E^ and let r satisfy 0 < r < 1. 
There is a unique n £ N with 2~ n < r < 2~ n+1 . The prefix a = a\n of length 
n defines a cylinder [a]. I claim that B r (a) = [a]. To see this, note that any 
string r £ [a] agrees with a at least for the first n letters, so g(a, t) < 2~ n < r. 
And any string r (jL [a] disagrees with a somewhere in the first n letters, so 
the longest common prefix is shorter than n, and thus g(er, r) > 2~ n+1 > r. 
The measure of the ball is M(-B r (cr)) = M([a]) = 2~ n , so M(7? r (cr)) < r, 
!M(S r (cr)) > r/2, and r < 2M(73 r .(<r)) . 

(a) First we prove < 4. Let <5 > 0, and let II be a 5-packing of 

E^\ Then the corresponding closed balls { B r (a) : (cr,r) £ II } are disjoint. 
So 


E ( 2r )^ 4 E M(B r (ff))< 4. 


(<x,r)£i7 (cr,r)(Ei7 


This is true for all 5-packings, so J’](E^) < 4. This holds for all 5 > 0, so 
< 4. 

Because E^ is a one-element cover of itself, we have also < 4. 

(b) Now we prove* 7q(E^) > 4. Fix N £ N, N > 2. Write rj — 1 — 2 ~ N , 
so 0 < 77 < 1. Note (1 + 2~ N )r] = 1 — 2~ 2N < 1. Let 5 > 0 be given. We will 
construct a 5-fine packing 77. Choose A7 £ N so that 2~ M <5/2. Define 


* A technical note: packing by balls all the same size is not the most efficient 
packing in this case! 
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N 0 = M,N\ = M + N, - ■ ■ ,N k = M + kN, • • • . 

For k = 0, 1, • • • , let 77*. be the set of all constituents (a, r ) where r = 2~ Nk+1 r] 
and the string er, counting from the beginning, has letter 0 in locations 
No,Ni,--- ,Nk~ i, letter 1 in location TV*, and all Os beyond location TV*. 
Pictorially, 

for n 0 

xx ■ • • xl 000 • • • , 

~M~ 

for III 

xx ■ ■ • xQ xx - ■ ./;1 000 ■ • • , 

M N 

and in general for Ilf, 

xx ■ ■ ■ xQ xx ■ y xO xx ■ y xO ■ ■ ■ xx ■ • ■ xO xx ■ ■ ■ xl 000 ■ • • , 

M N N N ~N 

where there are k blocks of length N. In the the locations x, arbitrary letters 
are allowed. The number of elements in 77* is determined by the number of 
locations where the letter may be freely chosen, so 77* has 2 M-1+fc ( JV-1 ) = 

2 N k -k-i e i emen t s . 

Let 77 = [J fc 77*. We claim that 77 is a packing. Let (<r, r), (<r', r') £ 77. 
We must show g(a,a') > r + r ' . First suppose that (a, r), (cr , ,r') are in the 
same 77*. So r = r 1 = 2 ~ Nk+1 r/. Strings a, a' differ somewhere in the first 
Nk — 1 places, so their longest common prefix has length at most Nk — 2, 
and g(a,a') > 2 _Wfc+2 . On the other hand, r + r' = 2~ Nk+1 r] + 2~ Nk+1 rj = 
2 ~N k + 2 rj < 2 -W+2. gg required. Now suppose (a, r) £ 77*, (■ a',r ') £ 77*/, 
k' > k. Strings a, a' differ in location TV*, so their longest common prefix 
has length at most Nk — 1, so g(cr,a') > 2~ Nk+1 . And r + r' = 2~ Nk+1 rj + 
2 ~ Nk '+ 1 r] < 2~ Nk+1 (l + 2 ~ N )r) < 2~ Nk+1 as required. 

Note that the packing 77 is (5-fine, since for any k, the radius 2~ Nk+1 r] < 
2 ~m+i rj < 2 _m+1 < S. Now compute 

OO 

e ( 2 r) = E E (^) 

(cr,r)Ei7 k— 0 (cr,r*)Gi7fc 

OO OO 

= 2 Nk ~ k ~ 1 2r]2~ Nk+1 =2 7? E 2 ~ k = 4 ^- 

k= 0 k—0 

This holds for all 5 > 0, so Tj(77^)) > 4 rj. Now let TV — > oo so that r/ —> 1, to 
get > 4. 

From (a) and (b), we have = 4 = 4M(E^). 
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(c) Next: if [a\ is a cylinder, then M([ a ]) =4x2 ML The right shift 
a i — ► aa is a similarity with ratio 2~' a \ so this follows from the case already 
proved. Then we have also T 1 ([a]) < 4M([a]). The method I outer measure 
M is the largest outer measure such that M([a]) < 2~' a ' for all cylinders. So 
we conclude M < 4M. 

The clopen sets in are exactly the finite disjoint unions of cylinders 
[a]. Two disjoint cylinders have positive distance. Therefore, by Lemma 6.2.6, 
for any clopen set V, we have MM) = 4M(V). 

(d) Now we claim that the class of clopen sets is a reduced cover class for 
M. Let F be a closed set and e > 0. Now M is a metric outer measure, so by 
Proposition 5.4.3, there is an open set F3F with M([/ \F) < e/4; while V 
is a union of cylinders, so by compactness we may replace it by a finite union. 
Then applying Lemma 6.2.6 we have 

MM) < MM) + MM \ F) < M(M + 4M(P \F)< M(M + e. 

So the collection of clopen sets is a reduced cover class. This means, in the 
application of method I to define T , we may use only covers by clopen sets. 
But we have To(P) = 4M(V) for all clopen sets V, and T is the largest outer 
measure such that M(P) < MM) for all clopen V. So 4M < M. 

Thus we get IP = 4M. Then in particular, 0 < < oo, so 

DimiM) = l. □ 


The Line 

Next is our first official example of a non-fractal. We proved CovR = 1 in 
Theorem 3.2.15. 

Proposition 6.3.3. The Hausdorff dimension of the line R is 1. 

Proof. By Theorem 6.1.4, we have Jf 1 ([0,l]) = £([0,1]) = 1. Therefore 
dim[0,l] = 1. Now [0,1] C R, so dimR > dim[0, 1] = 1. If s > 1, then 
!K S ([0, 1]) = 0. The intervals [n, n + 1] are isometric to [0, 1], so it follows that 
!K s ([n , n + 1]) = 0. Therefore 

OO 

M S (R) < X s {[n,n+1}) =0. 

71 = — OO 

This means that dimR < s. But this is true for any s > 1, so dimR < 1. 
Therefore we have seen that dimR =1. □ 

Exercise 6.3.4. The packing dimension of the line R is 1. 

Lebesgue Measure vs. Hausdorff Measure 

Since the Lebesgue measure was useful in computing dimR, it is easy to guess 
that £ 2 is useful in computing dimR 2 . 
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Proposition 6.3.5. The Hausdorff dimension of two-dimensional Euclidean 
space R 2 is 2. 

Proof. Consider the (half-open) unit square Q = [0, 1) x [0, 1). It is covered by 
n 2 small squares with side 1/n, soife> \/2/n, we have Jt e (Q) < n 2 {\/2/n) 2 = 
2. Therefore Jf 2 (Q) < 2, so dimQ < 2. 

On the other hand, if A is any cover of Q by closed sets, then (since any 
set A of diameter r is contained in a closed square Qa with side < r), 

Y, (diam A) 2 > ^ £ 2 (Qa) 

AeA AeA 

>C 2 I\Jq a 

\aga 

> C 2 (Q) = 1. 

Therefore Jt 2 (Q) > 1, so dim <3 > 2. 

For R 2 , since Q C I 2 , we have dimR 2 > dimQ = 2. If s > 2, then 
3t s ((2) = 0; but R 2 can be covered by a countable collection { Q n : n G N } of 
squares of side 1, so !H S (R 2 ) < ]P n !K s (Q n ) = 0. This shows that dimR 2 < s. 
Therefore dimR 2 <2. □ 

Note that the proof showed 0 < ^K 2 (Q) < oo, where Q is the unit square. 
What is the relation between the two measures C 2 and IK 2 on R 2 ? In fact, 
one of them is just a constant multiple of the other. 

Theorem 6.3.6. There is a positive constant c such that 

K 2 (B)=cC 2 (B) 

for all Borel sets B C R 2 . 

Proof. Let Q = [0, 1) x [0, 1) be the unit square. Let c = Jt 2 (Q). (We have seen 
that 1 < c < 2.) First, if B = rQ = [0,r) x [0,r), then TC 2 (B) = r 2 TC 2 (Q) = 
r 2 c = cC 2 (B). Next, the same is true for a translate of such a square. 

Both measures are metric measures, and these squares are Borel sets. So 
we have TC 2 (V) = cC 2 (V) for any finite disjoint union of squares. In particular, 
this holds for V belonging to the dyadic ring !R (see p. 155). 

Now consider TC 2 and C 2 restricted to subsets of a large square K = 
[— TV, N] x [—TV, TV]. The dyadic ring 1R is a reduced measure class for C 2 on 
K. And TC 2 (V) = c£ 2 (V ) it follows that is also a reduced measure class for 
TC 2 on K. Since TC 2 and C 2 agree on 3?, their method I extensions also agree. 
Thus C 2 and TC 2 agree on K. 

By countable additivity, 9i 2 and C 2 agree on the whole plane R 2 . □ 

Let d be a positive integer. The same method may be used to prove that 
there exists a positive constant Cd such that K d (B) = Cd£ d (B) for all Borel 
sets B C R d . 

Exercise 6.3.7. If B C R d , then dimS < d. If B contains an open ball, then 
dim B = d. 
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Arc Length 

Let /: [0, 1] — > S be a continuous curve in S. The arc length of the curve is 

n 

sup E 

i=l 

where the supremum is over all finite subdivisions 


0 = Xq < X\ < ■ ■ ■ < x n — 1 


of the interval [0, 1]. If the arc length is finite, then we say that the curve is 

rectifiable. 

Theorem 6.3.8. Let f: [0,1] — > S be a continuous curve, let l be its arc 
length, and write C = / [[0, 1]] . 

(a) l > fK 1 (C7),- 

(b) If f is one-to-one, then l = TC 1 (C). 

Proof, (a) Let e > 0. Now / is uniformly continuous (Theorem 2.3.21), so 
there is <5 > 0 such that g(f{x),f(y)) < £ whenever \x — y\ < S. Choose a 
subdivision 

0 = Xq < X\ < • • • < x n = 1 
of [0, 1] with \xi — i| < S for all i. Then the sets 


Ai = f [xi-i,Xi] 


cover C. (But diamA^ may not be g(f(xi-i),f(xi)).) By the compactness of 
[xi-i,Xi], there exist y,; , z-i with Xi-i < yi < z, < Xi such that diamA^ = 
Now we may use the subdivision 


0 < yi < zi < y 2 < z 2 < ■ ■ ■ < y n < z n < 1 


to estimate the length. So 

n n 

«>E^)’/^)) = E diamA, > , K 1 e (C). 

i— 1 i—1 

Now let £ -5- 0 to obtain l > TC 1 (C). 

(b) First, I claim that if 0 < a < b < 1, then Jf 1 (/[[a, 6]] ) > g(f(a),f(b)). 
To see this, consider the function h: f\\a, b\] — > K defined by h(u) = 
e(f(a) ,u). Now h is continuous, and h has values h(a ) = 0 and h{b) = 
g(/(a), /(&)), so by the intermediate value theorem, applied to the contin- 
uous function ho f : [a, 6] — ► R, we know that h also has all values between. 
Now h satisfies the Lipschitz condition \h(u) — h(v)\ < g{u,v), and we have 
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^(/[M]) >K'(h[f[[a,b}]]) 

> ft 1 (M/(a), /(&))]) 

= e(f(a)J(b))- 


This proves the claim. 

Now we apply this inequality. If we have a subdivision 
0 = x 0 < X\ < ■ ■ ■ < x n = 1 

of [0,1], then the set /[[xj_i, Xi)] = /[[xi_i,Xi]] \ {/(xi)} is the difference of 
two compact sets, hence measurable. The sets / [[xj_i, X;)] are disjoint, since 
/ is one-to-one. So 


^e(/(xi-i),/(xi)) < ^^(/[[x.-i.x,)]) 

2=1 2=1 

= ^1J /[[Xj_i,Xj)]^ 

= M 1 (/[[0,1)]) <K\C). 

This is true for all subdivisions, so l < 9~C 1 (C). □ 

Exercise 6.3.9. What is the relation between the surface area (of a surface 
in R 3 ) and its two-dimensional Hausdorff measure? 


Fractal Dimension vs. Topological Dimension 

We will see in the next section that it is possible for the Hausdorff dimension 
dimf to have a non- integer value. But it is not completely unrelated to the 
topological dimension. 

Theorem 6.3.10. Let S be a metric space. Then Co vS < dim S'. 

A complete proof of this result can be found, for example, in [18, Sect. 3.1]. 
(The proof uses Lebesgue integration, which we have avoided in this book.) 
Here, we will prove it for compact spaces: 

Theorem 6.3.11. Let S be a compact metric space. Then Co vS < dimS. 

Proof. Let n = Cov S. This means that Cov S < n — 1 is false. So there exist 
open sets Ui, C/ 2 , ■ ■ ■ , U n+ ± such that (J’Sf Ui = S, but for any closed sets 
Fi C Ui with US 1 Fi = we mus t have F t ^ 0. 
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Define functions on S' as follows: 

di(x) = dist(x, S \ Ui), 1 < i < n + 1 
d(x) = di(x) + d 2 {x) H 1 - d n+ i(x). 

The functions are continuous — in fact, Lipschitz: 

I di(x) - di{y ) | < g(x,y), 

I d{x) - d{y ) | < {n+l)g{x,y). 

Since the sets [7* cover S, we have d(x) > 0 for all x. So since S is compact, 
there exist positive constants a , b such that a < d(x) < b for all x £ S. Now 
define h: S — > K " +1 by 


h(x) 


di(x) d 2 (x) d ra+ i(x) \ 
d(x) ’ d(x) ’ ’ d(x) ) 


The function h is Lipschitz: 


di(x) di(y) 


d(x) d{y) 


< 


\d(x)dj(y) - d{y)dj(x)\ 
d{x)d{y) 

d(x) \di(y) - di(x) \ + di(x) | d(x) - d(y)\ 


d(x)d(y) 


and therefore 


b{n + 2 ) 

< — ^ — e(x,y), 


i uf ^ u , m ^ b(n + l)(n + 2 ) 

|/i(x) - /i(y)| < 5 £>(x,y). 


n+1 


Now I claim that h[S ] includes the simplex 

T = ^ (ti, t 2 , • ■ ■ , t n+ 1) £ K" +1 : ti > 0, L: = 1 j” ■ 

Given (ti,t 2 ,-- ■ ,t n+ 1 ) £ T, consider the sets 

F = { x ■ t . \ 

* \ d(x) - l J' 

Then F* is closed, F* C Ui, and IJ"^ 1 F, = S since di(x)/d(x) = 1. So we 
know by hypothesis that 7 ^ 0 - That is, there exists a point x £ S with 

di(x)/d(x) > ti for all i. But since JT di(x)/d(x) = 1 we have di(x)/d(x) = ti 
for all i. That is, h(x) = (ti, t 2 , • ■ • t n +i). So h\S] D T. 

Now T is isometric to an open set in K n . By Theorem 6.1.7 and Exer- 
cise 6.1.10, we have dim S' > dimF = n. □ 
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Exercise 6.3.12. Let S be a metric space. If Co v S > 1, then dim S' > 1. 


6.4 Self-Similarity 

Self-similarity is one of the easiest ways to produce examples of fractals. This 
section deals with the question of when the similarity dimension can be used to 
compute the Hausdorff dimension. When the two coincide we have a desirable 
situation: the similarity dimension is easy to compute, and the Hausdorff 
dimension is more generally applicable and has many useful properties. 

Let ,r n ) be a contracting ratio list. Let (/i,/2,-“ > fn) be an 

iterated function system of similarities realizing the ratio list in a complete 
metric space S. Let s be the sim-value for the iterated function system. Let 
K is the invariant set for the iterated function system. Of course, K is a mea- 
surable set, since it is compact. Does it follow that dim A" = s? In general, 
the answer is no. There is always an inequality dim A' < s. But simple exam- 
ples show that if there is “too much” overlap among the pieces /,[A], then 
dim .A < s is possible. 


String Models 

Hausdorff and packing measures are often easy to compute for the string 
models we use. Or if not easy to compute exactly, easy to estimate. Often 
estimates are good enough, since to compute the fractal dimensions dim and 
Dim it is enough to know merely whether !H S or !P S is positive or finite. 

When computing !H S or 7 s in our string spaces, we often already have a 
candidate measure JVC. This helps in the computation. 

Lemma 6.4.1. Let E be a finite alphabet, let be the space of all infinite 
strings constructed from E. Let s > 0 and let JVC be a finite metric outer 
measure on E^ . (i) If 7 q([o\) = JVC ([a]) for all a £ E^*\ then CP* = JVC. 
(ii) //CPg([a]) < JVC ([a]) for all a £ E^*\ then 7 < JVC. 

Proof. The balls in are the cylinders [a] . The clopen sets in are the 
finite disjoint unions of cylinders. Write Ik for the class of clopen sets in E^\ 
Note that Ik is an algebra of sets. By Lemma 6.2.6, we have CPg ( W) = JVC(P) 
for all V £ Ik in case (i) and Tq(V’) < JVt(E) in case (ii). 

Let F C E^ be closed, and let £ > 0. Then there is an open set [/DA 
with JVt([/\ A) < e. The open set U is a union of cylinders, so by compactness 
of A there is a finite union V of cylinders with A C V C U. (Alternatively, 
think of this as the fact that E^ u > is zero-dimensional.) We conclude (as in 
the proof of Proposition 6.3.2) that Pg^) — ^o(^) + £ - So Ik is a reduced 
cover class for 7 s . For any set A C E^ we have 
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y S (A) = sup £W): -4SU F n , F n closed > 

f n n ) 

= sup J ^3>S(V„) : Ac\jv n , V n ex\ 

\ n n ) 

= sup < : A C [J V n , V n £ Jl 

f n n 


= DVC(A) 


in case (i) and inequality < in case (ii). □ 

Exercise 6.4.2. Let E be a finite alphabet, let be the space of all infinite 
strings constructed from E. Let s > 0 and let M be a finite metric outer 
measure on E^\ If 5{ S ([a]) = DV[([a]) for all a € E^*\ then df S = M. If 
([a]) < M([a]) for all a £ then 5f s < M. 


The Natural Measure 

Begin with a contracting ratio list (r-i , r -2 ■ ■ ■ ■ ,r n ), with n > 1. Then the 
sim-value s associated with it is the unique positive number s satisfying 

n 

2=1 

Let E be an n-letter alphabet, and let E ^ be the string model. The 
metric g on E is defined so that the right shifts realize the given ratio list. We 
define r(a) recursively, starting with the empty string A, by: 

r(A) = 1, 

r(ae) = r(a ) r e , 

then define g so that diam[o:] = r(a). 

We will also need a measure defined to fit the ratio list. The basis is the 
equation defining the sim-value s: 

n 

2 = 1 

It follows from this that 

n 

( r{a)r z ) s = r{a) s . 

»= l 

That is, the expression r(a) s satisfies the additivity condition for a metric 
outer measure (Theorem 5.5.4). The measure M in question is defined on the 
string space E^\ and satisfies M([a]) = r(a) s for all a. Of course, it is no 
coincidence that s was chosen so that M([a]) = (diam[a]) s . 
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Theorem 6.4.3. Let E have metric g and measure M defined from ratio 
list (r e ) with sim-value s. Then (a) JC S = M, (b) there is a constant c > 0 so 
that = cM, and thus (c) dimE(“) = Dim 75^ ) = s. 

Proof. Write r max = max e r e and r min = min e r e . 

(a) If a set A C £?(“) has positive diameter, then (Proposition 2.6.7) there 
is a string a £ E with A C [a] and diam A = diam[a]. So M(A) < M([a]) = 
(diam[a]) s = (diamA) s . But J~C S is the largest outer measure with J~C e (A) < 
(diam A) s for all sets A of diameter < e. So JVC < 3~C e . This is true for all e > 0, 
so M < J( s . 

On the other hand, let a £ 7J(*' be a finite string, and e > 0. There is n 
so large that r" lax < s, n > |a|, and so r(/3) < e for all /3 £ E^ n \ The cylinder 
[a] is the disjoint union of all sets [/3], where (3 > a and \(3\ = n. Then 

^(H) < ( diam [^]) S = M (^D = M ([ a ])- 

0>a P>cx 

\0\=n \0\=n 

Let s — » 0 to get df S ([a]) < JVt([a]). Therefore TC S < JVC. 

(b) Now consider the packing measure. We will show that c = Iq (£?(“)) 
satisfies the condition. Note that 

5Pg(£ (w) ) > y s (£ (aj) ) > 0 

by part (a) and Proposition 6.2.14. 

We will describe the balls in E^\ Let a £ E^ and 0 < t < 1. Consider 
the ball B t {a). The ratios r(erfn) go to 0 as n — > oo, and r(A) = 1. So 
there is a unique n with r(cr\n) < t < r(cr\(n — 1)). Then as in the proof 
of Proposition 6.3.2 we have B t (a ) = [a] where a = a\n. And M(7? t (cr)) = 
JVt([a]) = r(a) s . Estimate from above: JVC (7? t (cr)) = r(a) s < t s . Estimate 
from below: JVC (B t (a)) = r(a\n) s > (r min r(a\(n - l))) s > (r min /2) s (2 t) s . 

Now let II be a packing of E^°\ The corresponding balls B t (cr), (o', t) £ 77, 
are disjoint. So 

E < 2t >‘ < E s fO-- 

(cr,t)&n min ((7,4)677 min 

This holds for all (5-fine packings, so Tg(E^) < 2 s /r^ nin . This holds for all 
(5 > 0, so < 2 s /r s min < oo. 

Thus c = CPq i s positive and finite. Now if a £ E^*\ then the right 
shift a i — > aa is a similarity with ratio r(a), and it maps E^ onto [a]. 
Therefore lPg([a]) = r(a) s c = cM([a]). And thus, by Lemma 6.4.1(i), we 
have 1P S = cM. □ 

Exercise 6.4.4. Let s be any positive real number. There is a metric space 
S with dim S = s. 
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Exercise 6.4.5. Prove versions of Lemma 6.4.1, Exercise 6.4.2, and Theo- 
rem 6.4.3 for the path spaces defined by a directed multigraph (V, E, i, t). 
Exercise 6.4.6. Compute the value of the constant c in Theorem 6.4.3(b). 

Cantor Dust 

Let us consider the fractal dimension of the triadic Cantor dust (defined on 
p. 2). The ratio list for this set is (1/3, 1/3). The string model is the set E^) 
of infinite strings from the alphabet E = {0, 1}, together with the metric 01/3. 
The two similarities on the model space are the right shifts, say 6 q and 9±, 
defined as follows: 

6> 0 (cr) = 0 a 

9i(a) = 1(7. 

Thus (9q, 61 ) is a realization of the ratio list (1/3, 1/3), with invariant set E^ u \ 

Proposition 6.4.7. The Hausdorff dimension and packing dimension for 
E^ u > with metric g \/ 3 are both log 2/ log 3. 

Proof. For the sim- value, solve 2 (1/3) s = 1 for s to get s = log 2/ log 3. By 
Theorem 6.4.3 we have dim = Dim = s. □ 

Corollary 6.4.8. The Cantor dust has Hausdorff dimension and packing di- 
mension log 2/ log 3. 

Proof. A lipeomorphism preserves the Hausdorff dimension (Exercise 6.1.10) 
and the packing dimension (Exercise 6.2.12). The addressing function h from 
onto the triadic Cantor dust C is a lipeomorphism (Proposition 2.6.3). 

□ 


Sierpinski gasket 

Next we discuss a slightly more difficult example, the Sierpinski gasket 
(see p. 8). 

Let S be the Sierpinski gasket. It is the invariant set for an iterated function 
system with ratio list (1/2, 1/2, 1/2). Let s [ = log 3/ log 2] be the sim- value 
of the ratio list. Let E = { L, U, R} be the appropriate three-letter alphabet. 
Next we describe the natural metric and measure defined on E^ from the 
ratio list. 

Let g be the metric on E ^ for the ratio list (1/2, 1/2, 1/2). That is, g is 
defined so that diam[a] = 2 - l“l for all a £ E^*\ Then the right shifts realize 
the ratio list: 

g(La, Lr) = - b(ct,t), 
g{(Jcr, Ur) = ^ 
g(Ra, Rr) = ^ q(ct,t). 
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The measure M is specified by M([a]) = Each node in E M has 

exactly 3 children, so these numbers satisfy the required additivity (Theo- 
rem 5.5.4). The fact to notice is this: 

M([a]) = (diam[a]) s 

for all a £ E^*\ where s = log 3/ log 2. By Theorem 6.4.3, the Hausdorff 
dimension of the string space E ^ is s = log 3/ log 2. 

The dimension calculation for the string model will be used to help with 
the dimension calculation of the Sierpinski gasket S itself. 

Let h: E^ —> R 2 be the addressing function that sends E ^ onto the 
gasket S. If the iterated function system in R 2 is (f\_, /u, /r), then 

M Lcr ) = f\-(h(cr)), 
h(Ua) = /u(/i(cr)), 
h(Ra) = f R (h(a)). 

Proposition 6.4.9. The Sierpinski gasket has Hausdorff dimension and pack- 
ing dimension at most log 3/ log 2. 

Proof. The addressing function h is Lipschitz (Exercise 4.2.1). By Exer- 
cise 6.1.10, we have dimS 1 < DimS" < DimE^ = log3/log2. □ 

For the general iterated function system, the upper bound is proved in the 
same way. 

Theorem 6.4.10. Let {r e ) e ^E he a contracting ratio list. Let s he its sim- 
value, and let {f e ) e ^E he a realization in a complete metric space S . Let K he 
the invariant set. Then dimE < Dim K < s. 

Proof. The string model E^) with the natural metric g has DimE^) = s 
(Theorem 6.4.3). The addressing function h : E ^ — > K is Lipschitz. Therefore 
DimE < s. □ 

Lower Bound 

The addressing function for the Sierpinski gasket is not inverse Lipschitz. In 
fact, it is not even one-to-one. (This is the answer to Exercise 4.2.2.) So we 
will need a bit more effort to prove the lower bound for the fractal dimension 
of S. Pay attention to the ingredients of the proof, since they will be used 
again for the general case. To simplify the notation, we will write L(j:) in 
place of fi(x), and similarly for the other two letters, and write a(x) for a 
finite string a. 

Proposition 6.4.11. The Sierpinski gasket S has Hausdorff dimension equal 
to the similarity dimension log 3/ log 2. 
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Proof. Let V be the interior of the first triangle So approximating the Sierpin- 
ski gasket S. Then C 2 (V) = VZ/A, and if |a| = \/3\, a / /3, then a[V] n/3[V] = 
0. Also, h [[a]] =a[V]flS. The set Sk approximating S is the union 

U “M- 

Given a set A C S, let k be the positive integer satisfying 
2~ k < diam A < 2~ k+1 . 

Let 

T = { a G E (k) : tfV] n A ± 0 } . 

I claim that T has at most 100 elements. Let m be the number of elements in 
T. A set a\V\ is the image of V under a similarity with ratio 2~ k , so it has 
area 

£ 2 (o[C])=4-‘^. 

The sets a\V] with a £ T are all disjoint. If a; is a point of A, then all of the 
elements of all of the sets a[V\ with a £ T are within distance diam A+2~ k < 
3 • 2~ k of x. So m disjoint sets of area 4 _fc v / 3/4 are contained in the ball with 
center x and radius 3 • 2~ k . Therefore 

m4" fc — < 7 t(3 • 2~ k ) 2 . 

4 

Solving for m, we get m < 36Tr/y/3, which is smaller than 100. 

Next I claim M(/i _1 [A]) < 100 (diam A) s for all Borel sets ACS. Given 

A, let k and T be as above. Then A C IU T a \ v }i so h c LUtM 
Therefore 

a£T 

< 100 x 3 _fc = 100 (2~ k ) s 

< 100 (diam A) s . 

By the Method I theorem, M(A -1 [A]) < 100, 3~C S (A) for all Borel sets A. 
So 1 < 100, Jf s (5), and therefore duns' > s. □ 

Exercise 6.4.12. Improve the estimate 100. 


6.5 The Open Set Condition 

Let (ri, • • • , r n ) be a contracting ratio list with dimension s. Let (/i, • • • , f n ) 
be a corresponding iterated function system of similarities in R d . Suppose K 
is the invariant set for the iterated function system. Write s for the sim- value. 
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In general it is not true that dim K = s. For example, consider the iterated 
function system (/l, /u, /r) for the Sierpinski gasket, realizing the ratio list 
(1/2, 1/2, 1/2). Now the iterated function system(/i_, /l, /u, /r) has the same 
invariant set, of course, but it realizes the longer ratio list (1/2, 1/2, 1/2, 1/2). 
The Hausdorff dimension of the invariant set K is log 3/ log 2, but the sim- 
value of the iterated function system is 2. 

Of course, the problem is that the first two images fi[K] and f\_[K] overlap 
too much. Now we might require that the images do not overlap at all, as in the 
Cantor dust. But that would rule out many of the most interesting examples, 
such as the Sierpinski gasket itself, where the overlap sets like f\_[K ] n fy>[K\ 
are nonempty. 

We do have inequality between the Hausdorff dimension, packing dimen- 
sion, and similarity dimension. If s is the similarity dimension, then the string 
model has packing dimension s and the addressing function is Lipschitz, so 
dim K < Dim K < s. 


Lower Bound 


Now we turn to the “lower bound” proof. That is, we want to show dim K > 
something or Dim K > something. Generally we do this by showing K S (K ) > 
0 or T(K) > 0. 

The iterated function system (/i, ■ ■ ■ , f n ) satisfies Moran’s open set 

condition iff there exists a nonempty open set U for which we have fi[U] fl 
fj[U] = 0 for * ^ j and U 2 fi[U\ for all i. Such an open set U will be called 
a Moran open set for the iterated function system. 

For example, consider the Cantor dust. The similarities are 


fo(x) 

AO) 


x 

3’ 

x T 2 
3 


The open set U = (0, 1) is a Moran open set: the two images are (0, 1/3) and 
(2/3,1), which are disjoint and contained in U. 

Or, consider the Sierpinski gasket (Fig. 1.2.1). The interior* U of the large 
triangle So is a Moran open set. The three images are three small triangles, 
contained in U , and disjoint. 

For a third example, consider the Koch curve (Fig. 1.5.1). The interior of 
the triangle Lq is a Moran open set. 

The fourth example to consider is Heighway’s dragon. This time the open 
set condition is not quite as trivial. The interior U of Heighway’s dragon itself 
will serve. The fact that the two images are contained in U is a consequence of 
the fact that Heighway’s dragon itself is the invariant set of the iterated func- 
tion system. The fact that the two images are disjoint is a consequence of the 

* The interior of a set consists of all the interior points of the set. 
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fact that the approximating polygon never crosses itself (Proposition 1.5.7). 
The verification requires some work, but it is left to the reader: 

Exercise 6.5.1. The interior of Heighway’s dragon is a Moran open set for 
the iterated function system realizing Heighway’s dragon. 

In one case, the open set condition is easily verified: 

Exercise 6.5.2. If the invariant set K for an iterated function system {/,} 
satisfies fi[K\ D fj[K] = 0 for i ^ j, then Moran’s open set condition is 
satisfied. 

The proof of the lower bound will proceed following the same technique as 
Proposition 6.4.11. The area is replaced by the d-dimensional volume, namely 
C d . You may find it instructive to compare this argument with the proof of 
the special case in Proposition 6.4.11. 

Let E be an alphabet with n letters. Write the ratio list as (r e ) e€ E and 
the iterated function system as (f e ) e£ E- To simplify the notation, we will 
write e(x) in place of f e (x), and similarly a(x) for a finite string a. With this 
notation, the model map h: satisfies h(ea) = e(h{a)) for a G £(*) 

and e £ E. 

The open set condition implies that a[U] fl (3[U\ = 0 for two strings 
a,/3 £ E^ unless one is an initial segment of the other. If a is a string with 
length k > 1, we will write a~ for the parent of a ; that is: a~ = ct|"(fc — 1). 

Lemma 6.5.3. Let (r e ) e6 E be a contracting ratio list. Let s be its sim-value, 
and let (f e ) ee E be a realization in R d . Let K be the invariant set. Let U be a 
Moran open set for ( f e ). Then there is a constant c > 0 such that: if A C K, 
then the set 


T = | a € E : a[U] Hi / 0, diama[17] < diamH < diama [J7] j 


has at most c elements. 

Proof. As a ranges over T, the sets a[U] are disjoint, since no such a is an 
initial segment of another. The map f a is a similarity with ratio equal to 
diam[a], so if w is the diameter of U 7 then w diam[a] is the diameter of a\JJ\. 
Write r m ; n = minr e . Then 


diama[?7] = tcdiam[o:] > ror m ; n diam[a ] 

= r min diam a~ [U] > r m i n diamA 

If p = C d (U) is the volume of U, then the volume of a[U] is 
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If x is a point of A, then every point of every set a[U\ for a £ T is 
within distance diam A + diamond] < 2 diam A of x. If m is the number of 
elements of T, then we have m disjoint sets a[U], all with volume at least 
(pr^ lin /w; d )(diam A) d , contained within a ball of radius 2 diam A So if t = 
C d (B i(0)) is the volume of the unit ball, we have 


m P r min 
w d 


(diam A) d < f(2diam T) d . 


Solving for m yields 

tu> d 2 d 


Summary: We may use the constant c = tw d ‘2 d /pr d lin , where r = minr e , t 
is the volume of the unit ball, p is the volume of the Moran open set U, and 
w is the diameter of U. □ 


Theorem 6.5.4. Let (r e ) e ^E be a contracting ratio list. Let s be its sim-value, 
and let (f e ) e ^E be a realization in R d . Let K be the invariant set. If Moran’s 
open set condition is satisfied, then dim If = s. 

Proof. Let c be a constant as in the lemma. I claim there is a positive constant 
b so that for any Borel set A C K, we have 

M {h-^A]) < b (diam A) s . 

Let U be a Moran open set, and let w = diam U. Given A, let T be as 
in the lemma. So A C [_] a ^ T a\U], and h~ 1 [A\ C Uq-gtI 0 ]- If ct £ T, then 
M([a]) = (diam[a]) s = ((l/u>) diama[?7]) s < (l/u> s )(diam A) s . Therefore 

aeT 

< c(l/w; s )(diam A) s . 


So b = c/w s will work. 

Therefore, by the Method I theorem, 1 = M(/i _1 [If]) < b3~C s (K ), so we 
have dim If > s. □ 

The proof given above clearly used the properties of Lebesgue measure in 
R d . What happens in other metric spaces? Readers who know about some 
exotic metric spaces may like to attempt this: 

Exercise 6.5.5. Let 5 be a complete metric space (other than K d ). Let 
(/i, fi, ■ ■ ■ , f n ) be a realization in S' of a contracting ratio list (ri, r%, ■ ■ ■ , r n ) 
with dimension s. Let K be the invariant set. Suppose Moran’s open set con- 
dition is satisfied. Does it follow that dim If = s? 
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Exercise 6.5.6. Let - ,r n ) be a contracting ratio list with dimen- 

sion s. Let (/i,/2,-“ , fn) be an iterated function system consisting not of 
similarities, but of maps fi : K d — > satisfying 

> rig{x,y). 

Suppose the open set condition holds, and suppose there is an invariant set 
K. Does it follow that dim K > s ? 


Heighway Dragon Boundary 

Heighway’s Dragon (p. 20) is a set P in the plane with nonempty interior (a 
space-filling curve) which tiles the plane (p. 74). We have Co vP = dimP = 
DimP = 2, so it is not a fractal in the sense of Mandelbrot. We now have the 
tools to analyze the boundary of P. It turns out that dP is a fractal. So the 
tile P is an example of what we have called a fractile. 

Recall the discussion on p. 74 showing that P tiles the plane. Let us con- 
tinue with this line of reasoning. In Plate 16 we see a black segment from A 
to B that produces a sequence P n of polygons that converge to P, the black 
tile in Plate 17. 

First note that the point A in Pq also belongs to all P„ and is a boundary 
point of P, since point A lies not only in the black tile P, but also in the 
brown, gray, and red tiles. Similarly point B is a boundary point of P. We 
will write dP = U U V as follows. Set U is the portion of the boundary to the 
left of curve AB — that is, the points that belong not only to the black tile, 
but also to at least one of the brown, blue, or yellow tiles. Set V is the portion 
of the boundary to the right of curve AB — that is, the points that belong not 
only to the black tile, but also to at least one of the red, green, or cyan tiles. 
(In fact, the cyan tile never meets the black tile, as we can see by looking at 
Pi in Plate 16.) No other tiles can touch the black tile, because the plane has 
topological dimension 2, so they would have to cross one of the curves shown 
to reach the black tile. 

Consider set U. According to Pl in Plate 16, the “midpoint” C of P is a 
boundary point, since it lies in both the black and blue tiles. The portion of 
U between A and C is a copy of U shrunk by factor 1 / V2. The portion of U 
between B and C is a copy of V shrunk by factor 1 / y2. 

Now consider set V. Look at P 2 in Plate 16. The “three-quarter” point 
D of P is a boundary point, because it lies in both the black and red tiles. 
The portion of V between B and D is a copy of U shrunk by factor 1/2. 
The portion of V between D and A is the boundary between black and red; 
looking at it from the red point of view, we see it is a copy of U shrunk by 
factor 1/2. 

Set V is made up of two copies of U, so dim U = dim V and Dim U = 
DimP. Set U is made up of one copy of U (shrunk by factor 1/^2) and one 
copy of V (shrunk by factor l/y/2). That copy of V is made up of two copies 
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of U each shrunk by a further factor of 1/2. So the complete decomposition 
has U made up of three copies of itself, with ratio list (2 -1 / 2 , 2 -3 / 2 , 2 -3 / 2 ). 

This ratio list has sim- value s « 1.52 given by s = 2 log A/ log 2, where 
A ~ 1.6956 is a solution of A 3 — A 2 — 2 = 0. In order to claim that this value 
s is also the Hausdorff and packing dimensions of U , we need an open set 
condition. Define an open set G as follows: in Plate 16 start with the four 
segments: black, yellow, blue, brown. The curve U lies in the union of the four 
tiles they produce: see the black, yellow, blue, brown in Plate 17. The open 
set G will be the interior of this union. Set G is shown in blue in Plate 18, 
with U shown in yellow. The images of G under the three maps that make 
up the iterated function system for U are shown in the second picture. The 
large red set is an image of G shrunk by factor l/y/2. The cyan and green 
sets are images of G shrunk by factor 2 -3 / 2 . These (open) images are disjoint, 
since the tiles generated from different segments in Plate 16 have disjoint 
interiors. The three images descend from the edges bordering the like-colored 
squares shown in P 3 of Plate 18. This completes our description of the open 
set condition. 


Eisenstein Boundary 

Let us consider next the set F of “fractions” for the Eisenstein number system 
(Fig. 1.6.11). The base is b — —2, and the digit set consists of 0, 1, A = cj, and 
B = lo 2 . The set F may be done in the same way as the twindragon (p. 33). 
But let us proceed in a more direct way. The first set L Q is just the point 0. 
The next set L\ consists of the four points (.0)_2, (-1)— 2 , (.A)_ 2 , and (.£?)_ 2 . 
The set L 2 contains 16 points, all that can be represented with two digits in 
this system. The illustrations in Fig. 6.5.7 show these approximations. The 
set F obtained this time has (of course) similarity dimension 2. 

To see that F is a “fractile” we need to analyze its boundary. The Eisen- 
stein boundary is made up of six congruent parts, as shown in Fig. 6.5.8. The 
individual parts are self-similar. Each consists of three copies of itself, shrunk 
by factor 1/2. See Fig. 6.5.9(a). 

Drawings in Logo can be done. When it is done as a curve, we need to 
take into account that some parts are drawn backward and/or reflected. 


L 


0 





L 


3 


Fig. 6.5.7. Eisenstein fractions 
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Fig. 6.5.8. The Eisenstein boundary consists of six congruent parts 


to EBforward "depth "size "parity 

if : depth = 0 [forward :size stop] [ 
left (60 * : parity) 

EBbackward (:depth - 1) (:size / 2) (-:parity) 
right (120 * : parity) 

EBforward (: depth - 1) (:size / 2) : parity 
left (60 * : parity) 

EBforward (: depth - 1) (:size / 2) (-:parity)] 

end 

to EBbackward "depth "size "parity 
if : depth = 0 [forward :size stop] [ 

EBbackward (:depth - 1) (:size / 2) (-:parity) 
right (60 * : parity) 

EBbackward (: depth - 1) (:size / 2) : parity 
left (120 * :parity) 

EBforward (: depth - 1) (:size / 2) (-: parity) 
right (60 * : parity)] 

end 

The iterated function system and the open set condition for this curve are 
both illustrated by Fig. 6.5.9(b) (c). So we may conclude that the Eisenstein 
boundary has fractal dimension s (both Hausdorff and packing) satisfying 
3 (l/2) s = 1, so s = log 3/ log 2. This is > 1, so this boundary is a fractal. 
That is, F is a fractile. 

Examples for the Reader 

Here is another dragon curve (Fig. 6.5.10, Plate 11), known as the terdragon. 



Fig. 6.5.9. (a) Decomposition (b)(c) Iterated function system and OSC 
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Fig. 6.5.10. Terdragon 


make "shrink 1 / sqrt 3 
to ter : depth :size 

if : depth = 0 [forward :size stop] [ 


right 30 





ter : depth - 
left 120 

1 

: size 

* 

: shrink 

ter : depth - 
right 120 

1 

: size 

* 

: shrink 

ter : depth - 
left 30] 

1 

: size 

* 

: shrink 


end 

This is a space-filling curve. (Six copies of the terdragon exactly fit around 
a point; Plate 12.) It is not a fractal. But what about its boundary? Is it a 
“fractile” ? 

Exercise 6.5.11. Compute the Hausdorff and packing dimensions for the 
boundary of the terdragon. 

A variant of the dragon that constructs the McWorter pentigree (p. 24) is 
shown in Fig. 6.5.12. Five copies of the limit set fit together to form a certain 
“dendrite’.’ This will be called the pentadendrite. (Fig. 6.5.13). 

Exercise 6.5.14. Compute the topological, Hausdorff, and packing dimen- 
sions of the pentadendrite. 

The set of “fractions” for the number system with base — 1 + i and digit 
set {0, 1} is the twindragon (Fig. 1.6.8). 

Exercise 6.5.15. Compute the Hausdorff and packing dimensions of the 
boundary of the twindragon. 
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Pentadendrite construction 



Fig. 6.5.13. Complete pentadendrite 

Exercise 6.5.16. Compute the Hausdorff and packing dimensions for the 
limit of the sets constructed by the following program (p. 116). Warning: 
it is not self-similar. 

to Schmidt : depth :size 
if : depth = 0 [stop] [ 
repeat 3 [ 

forward :size 

Schmidt : depth - 1 :size / 2 
right 120] ] 

end 

Another example of the same kind is the “I” fractal, p. 135. 
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Fig. 6.5.17. Leaf outline; leaf lake 

Barnsley Leaf Outline 

Recall Barnsley’s leaf B (p. 26). Because of the overlap among the parts of 
the iterated function system, the open set condition fails, so the methods of 
this section cannot compute its dimension (as far as I know). Discussion of 
overlap is in Sect. 7.1. But there are some related dimensions that can be done 
now. When all of the surrounded areas of B are filled in, we get a solid leaf 
with fractal boundary. Call that boundary the “leaf outline” J . Or consider 
one of the small regions surrounded by the leaf: call that a “leaf lake” K. 

We will do a “deconstruction” of the leaf in Sect. 7.1 and conclude that 
there is a set H so that B consists of 8 copies of H (and its reflection H') 
arranged as shown in Fig. 6.5.18: four copies of the tile H on the right, and 
four copies of the reflected tile H' on the left. 

Accordingly, the outline J is made up of 8 copies of the “upper left” edge of 
H (when oriented as shown). Tile H is deconstructed as in Fig. 6.5.19(a); the 
portion in the lower right is irrelevant for us now. Thus the segment of the leaf 
outline obeys an iterated function system also represented by 6.5.19(a). The 
picture also provides a Moran open set. The ratio list is (2 _1 , 2 -3 / 2 , 2 -3 / 2 ). 
The fractal dimension is — 2 log r / log 2 « 1.21076, where r « 0.657298 is a 
solution of the cubic r 2 + 2r 3 = 1. 


6.6 Graph Self-Similarity 

Next we consider evaluation of the Hausdorff dimension connected with graph 
self-similar sets. 



Fig. 6.5.18. Deconstruction of Barnsley’s leaf 
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Fig. 6.5.19. (a) Decomposition of H (b) Self-similarity of the segment 


The Two-Part Dust 

We begin with a simple example, the two-part dust. It has been “rigged” 
so that the calculation of the dimension is easier than the general case. The 
Mauldin-Williams graph is as illustrated in Fig. 6.6.1. Here is a description of 
the realization in R 2 that will be considered. The map a has ratio 1/2, fixed 
point (0,0), and rotation 30 degrees counterclockwise. The map b has ratio 
1/4, fixed point (1, 0), and rotation 60 degrees clockwise. The map c has ratio 
1/2, fixed point (0,0), and rotation 90 degrees counterclockwise. The map d 
has ratio 3/4, fixed point (1,0), and rotation 120 degrees clockwise. 

As we know, there is a unique pair of nonempty compact sets U,V CM 2 
satisfying 


U = a\U]U b[V] 

V = c[U] U6[V]. 

This pair of sets is the two-part dust. A sequence of approximations is pic- 
tured in Fig. 6.6.2. They converge in the Hausdorff metric. We may start with 
any two nonempty compact sets Uq and Vo in R 2 . In this case, both have 
been chosen as the line segment from the point (0, 0) to the point (1, 0). Then 
further approximations are defined recursively: 


U n+1 =a[U n ]Ub[V n ], 

V n+1 = c[U n ]Ud[V n ]. 

The sequence ( U n ) converges in the Hausdorff metric to a nonempty compact 
set U, and the sequence {V n ) converges in the Hausdorff metric to a nonempty 
compact set V . This pair of sets is the required invariant list. 

Here is the Logo program for the pictures. 

; two-part dust 
to U : depth :size 



Fig. 6.6.1. Graph for the two-part dust 
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Fig. 6.6.2. Two-part dust 


if : depth = 0 [forward :size stop] [ 
left 30 

U : depth - 1 :size / 2 
penup 

back :size / 2 
right 30 
forward :size 
right 60 
back :size / 4 
pendown 

V : depth - 1 :size / 4 left 60] 

end 

to V : depth :size 

if :depth=0 [forward :size stop] [ 
left 90 

U : depth - 1 :size / 2 
penup 

back :size / 2 
right 90 
forward :size 
right 120 
back :size * 0.75 
pendown 

V : depth - 1 :size * 0.75 
left 120] 

end 

We are interested in computing the Hausdorff (and packing) dimensions 
of the sets U and V. Since each of the sets is similar to a subset of the other, 
their dimensions must be the same. As usual, we begin by computing the 
dimension of the path models corresponding to the Mauldin-Williams graph. 
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We will need to use the Perron numbers of the graph. In this case, 
the Perron numbers (one for each node) are q\j = 1/3 and gv = 2/3. The 
important facts about these numbers are: they are positive and they satisfy 
equations 

9u = r{a)q v + r(b)qy, 

<?v = r{c)q u + r(d)g v . 

By Proposition 4.3.16, the graph has* sim-value 1. I want to show that U 
and V have Hausdorff dimension 1. We will use the Perron numbers to assign 
diameters to the nodes of the path forest. Two ultrametrics g , one on each of 
the two path spaces E^\ E^\ will be defined so that the diameters of the 
basic open sets [a] are as follows: Begin with diam[Tu] = q\j and diam[Tv] = 
gv- If a is a path and e is an edge with t(e) = *(a), then the diameter for the 
string ea is diam([ea]) = r(e) diam([a]). 

Next, we will use the same numbers to define measures. 

There is a metric measure DVC on each of the path spaces E^ such that 
M([a]) = diam([a]) for all finite paths a. The additivity condition (Exer- 
cise 5.5.5) is true by equations (1). Of course we can repeat the steps from 
Lemma 6.4.1 and Exercise 6.4.2 to conclude that the path spaces E\f\ Ey"' 1 
both have Hausdorff and packing dimension 1. (Therefore, we say that U and 
V have [graph] similarity dimension 1.) 

But we are more interested in the Hausdorff and packing dimensions of U 
and V . We will need an “open set condition’.’ A little experimentation with a 
graphics program reveals that this may be satisfied by the two sets pictured 
in Fig. 6.6.3. The two sets are a rectangle and an irregular hexagon. (The 
images of these sets under the maps are appropriately disjoint and contained 
in the appropriate sets.) The dimension is now easy to check. 

Exercise 6.6.4. Let U and V be the two parts of the two part dust. Show 
that the addressing functions 

hu : E^ -> R 2 
hv : E ( y ] -> R 2 

are lipeomorphisms. 



Fig. 6.6.3. Open set condition 


Solution to Exercise 4.3.14. 


6.6 Graph Self-Similarity 


203 


As consequences of this we have dim U = Dim U = dim E 
dimF = DimF = dimF^-* = 1. 


u 


1 and 


Perron Numbers 

To compute the Hausdorff dimension for the other examples with graph self- 
similarity, we need only find the proper sort of “Perron numbers” in those 
cases. (It will not be quite as simple as the rigged example above if the di- 
mension is not 1.) 

Consider a Mauldin-Williams graph (V, E,i,t,r). We will consider only 
the case when the graph is strongly connected (p. 80). This will mean that 
when the invariant set list is found, each of the sets will be similar to a subset 
of each of the others. So they will all have the same fractal dimension. 

We are interested in assigning metrics to the spaces E ^ of strings. (There 
is one space for each tree in the path forest.) The realization consists, as usual, 
of the right shifts. For an edge e £ E uv , the function 9 e defined by 

9 e (a) = Ca- 
rnaps E^ to E^ . The metrics should be chosen in such a way that 6 e is a 
similarity with ratio r(e). We are also interested in defining measures (one for 
each space E^) that will make the computation of the Hausdorff dimension 
easy. 

In order to do this, we need the proper Perron numbers. If s is a positive 
real number, then s-dimensional Perron numbers for the graph are positive 
numbers q v , one for each vertex v £ V, such that 

q s u = E r ^y 

v£V 

e£E u v 


for all u £ V. 

There is exactly one positive number s such that s-dimensional Perron 
numbers exist. This unique number s will be called the sim-value of the 
Mauldin-Williams graph. The existence and uniqueness of the sim-value were 
proved for the case of a 2 node graph in Sect 4.3. For the general graph, the 
proof requires some linear algebra. See Theorem 6.9.5. 

We can proceed even without the proof of this result: if we can find Perron 
numbers, then we will be able to do the computations. When the set V of nodes 
is small, finding Perron numbers can often be done by trial and error. 


Fractal Dimension 

Now that all of the ingredients have been specified, we may proceed to analyze 
the Hausdorff and packing dimensions in this case. Suppose that (V, E, i, t, r ) 
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is a strongly connected, contracting Mauldin- Williams graph. Let s > 0 be 
such that s-dimensional Perron numbers q v exist. We will suppose that the 
graph is strictly contracting so that r(e) < 1 for all e. We will compute the 
dimension for the path model. There is one path space Ey for each node 
v £ V. 

First we need metrics for the path spaces. We want the right shifts to 
realize similarities with the ratios assigned by the Mauldin-Williams graph. 
For each finite path a, let r(a) be the product of the numbers r(e), for all the 
edges e in a. For a £ E^J , we want the diameter of [a] to be r(a)q v . 

Ultrametrics g exist with these diameters. (One for each space E^*\) They 
satisfy 

e(ecr,er) = r(e) g{a,r) 
for a,T £ Ey and e £ E uv . 

Next we want to define measures on the path spaces. (The measures will 
all be called M.) Because of the equations satisfied by the Perron numbers, we 
see that the values (diam[a]) s satisfy the additivity condition (Theorem 5.5.4), 
namely 

(diam[a]) s = ^ (diam[ae]) s . 

i(e)=i(a) 


So there exists a metric measure on each of the spaces E^f 1 satisfying M([a]) = 
(diam[a]) s for all a £ E^*\ 

We can easily find an upper bound for the packing dimension. This is done 
in much the same way as has been done in previous cases. By Lemma 6.4.1, 
there is a positve constant c such that J >s (ui“- ) ) = cM(-E^) = eg*. And by 
Exercise 6.4.2, TC S ( E = M(E^) = g*. But 0 < g* < oo, so dim E^ = 
Dimi?^ = s. 

Once we know the Hausdorff and packing dimensions of the path spaces, 
we can try to apply it to the sets in that we are really interested in. Let 
( fe) e eE be an iterated function system realizing the Mauldin-Williams graph 
(V, E,i,t,r) in M. d . Let ( K v ) ve y be the unique invariant list of nonempty 
compact sets. As usual, we may construct the addressing functions 

h v : E^ ^ R d , 


one for each v £ V, such that 

h u (ea) = f e (h v (a)), 

for a £ Ey*' 1 and e £ E uv . Then h v [Ei u> ' 1 } = K v for v £ V. These 
are Lipschitz maps, so the upper bound for the fractal dimensions is easy: 
dimA'„ < DimiW < Dim E^ = s. 
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For the lower bound, we need to limit the overlap. We will use an open set 
condition. 

Definition 6.7.1. If (/ e ) is a realization of (V, E, i, t, r ) in M d , then we say it 
satisfies the open set condition iff there exist nonempty open sets U v , one 
for each v £ V, with 

Uu D fe[U v } 

for all u, v £ V and e £ E uv ; and 

fe[U v \ nfe'[U v >] = 0 

for all u, v, v' £ V, e £ E uv , e! £ E uv i with e ^ e'. 

Now the argument proceeds as before. Fix a node v £ V. I want to show 
that dim K v > s. As before, if a is a finite (nonempty) string, write a~ for its 
parent. Also, we will use the notation e(x) for / e (x), and similarly for strings: 
a(x). 

First, I claim that there is a constant c > 0 such that: if A C K v , then the 
set 

T = { a £ £« : H A ^ 0, 

diama[t/ t (a)] < diamA < diama _ [t/ t (a-)] | 

has at most c elements. 

Writing 

w m ax = max diam U u , w m i n = mindiam U u , r m i n = minr e , 

u£V uev eeE 

we have for a £ T: 


diam a[17 t (a)] = r{a) diam C/ t («) > w m i n r mit1 r(a ) 


> 

-dr 


w, 


min' min 


^min n . — rrr i ^min^min 

diam a [U t (a-)\ > 


diam A. 


^max ^max 

Now if p = min ug y C d (U u ), we have the volume calculation 

d 


rd, rrr iw /diam a[U t(a) \ 


A(c) 


>p 


/^min^min\ ✓ a\ Q 

1 w *2 ) (diam A) 


Now if i £ A, then every point of every set ct[U t / a ^\ for a £ I 1 is within 
distance diam A + diama[?7 t (a)] < 2 diam A of x. The sets a[U t ( a )] are dis- 
joint, so if T has m elements, then there are m disjoint sets, with volume 
at least p(w m i n r min /w;* 2 ) d (diam A) d inside a ball with radius 2diamA. If 
t = £ d (l?i(0)), we have 


mp ( Wmin? / nin \ (diamA) d < f(2diamA) d . 
V w* / 
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Solving for to, we get 


< t / 2 w* 2 
~ P V^minrmin 


d 


Next, I claim that there is a constant b > 0 so that for any Borel set 
A C K v , we have 

M (h-^A}) < 6(diamA) s . 

Given A, let T be as before. Write q = max ue y q u . Then h~ l [A\ C UaerM' 
If a £ T, then 


M([a]) < r(a) s q s < q s y- 


/diama[17 t ( a )]\ A g s (diamA) 


< 


Therefore 

< ^M([a]) < (diam A) s . 

a£T Wmin 

Then we conclude from the Method I theorem that < bJi s (A) for 

all Borel sets A. In particular, 


H'(K V ) > 


Mih-'IK^) _ m(e; 


H 


~ = f >0 - 

b 


Therefore dimiW > s. And of course Dimif„ > dim K v . 
We have established the result: 


Theorem 6.7.2. Let (V, E,i,t,r) be a strongly connected contracting Maul- 
din- Williams graph describing the graph self- similarity of a list ( K v )vgv of 
nonempty compact sets in R d . Let s > 0 be such that s-dimensional Perron 
numbers exist. Then dimAT„ < T)imK v < s for all v. If, in addition, the 
realization satisfies the open set condition, then dimA'„ = Dim AT,, = s. 

Exercise 6.7.3. Let (V,E,i,t,r) be a strongly connected contracting Maul- 
din-Williams graph. Let ( f e )e£E be a family of maps on satisfying 

Q{fe(x)J e {y)) < r{e)g{x,y). 

Formulate the proper analog of Theorem 6.7.2. 

Exercise 6.7.4. Let (V,E,i,t,r) be a Mauldin-Williams graph. Let ( f e )e£E 
be a family of maps on satisfying g(f e {x),f e (y)) > r(e)g(x, y). Formulate 
the proper analog of Theorem 6.7.2. 

Exercise 6.7.5. Discuss Hausdorff dimension for graph self-similar sets with 
Mauldin-Williams graph not strongly connected. 
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Li Lake 

The graph self-similar set called Li’s Lace is seen on p. 84. Its description is 
on p. 126 shows it is is made up of isosceles right trianglar blocks of two kinds, 
called P and Q. An open space completely surrounded by the fractal is called 
a lake. The boundary of one lake is shown in Fig. 6.7.6(b). This boundary can 
also be described in the language of graph self-similarity. Fig. 6.7.7 identifies 
two boundary parts A and B in tile P. The lake boundary is made up of eight 
parts A. 



Fig. 6.7.6. (a) Two tiles P (b) Lake boundary 



Fig. 6.7.7. (a) Parts A and B (b) Four parts A = half of boundary 
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Exercise 6.7.8. The parts A and B may be described using the descriptions 
of P and Q from Fig. 4.3.3. Set A is made up of two copies of A shrunk by 
factor 1/4 and one copy of B shrunk by factor 1/4. Set B is made up of four 
copies of A shrunk by factor 1/2 and one copy of B shrunk by factor 1/2. 
Compute the corresponding sim- value for sets A and B. Verify the OSC to 
conclude this is the Hausdorff and packing dimension for the lake boundary. 


Pentigree Outline 


Recall the construction on p. 25 of the second form of McWorter’s pentigree. 
The pentigree outline is what we get if we fill in all the “lakes/ and take 
the boundary of the result (Fig. 6.7.9, see also p. XIII). 

Here is the program used to draw Fig. 6.7.9. 


; pentigree outline 

make "shrink (3 + sqrt 5)/2 

to pent : depth :size 

repeat 5 [A : depth :size] 

end 

to A : depth :size 

if : depth = 0 [forward :size right 72 stop] [ 
B : depth - 1 :size / : shrink 

A : depth - 1 :size / : shrink 

A : depth - 1 :size / : shrink 

BR : depth - 1 :size / : shrink] 

end 

to B : depth :size 

if : depth = 0 [forward :size left 36 stop] [ 

C : depth - 1 :size / : shrink 

A : depth - 1 :size / : shrink 

BR : depth - 1 :size / : shrink] 

end 

to BR : depth :size 





Fig. 6.7.9. Pentigree outline 
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if : depth = 0 [forward :size left 36 stop] [ 

B : depth - 1 :size / : shrink 

A : depth - 1 :size / : shrink 

C : depth - 1 :size / : shrink] 

end 

to C : depth :size 

if : depth = 0 [forward :size left 72 stop] [ 

B : depth - 1 :size / : shrink 
BR : depth - 1 :size / : shrink] 

end 

Exercise 6.7.10. Does this really converge to the outline of McWorter’s 
pentigree? 

Exercise 6.7.11. Determine the Mauldin- Williams graph describing the self- 
similarity of curve A. 

Exercise 6.7.12. Compute the Hausdorff dimension of the pentigree outline. 

Number Systems 

Let b be a complex number, |6| > 1, and let D be a finite set of complex num- 
bers, including 0. We are interested in the numbers that can be represented 
in the form 

OO 

J2 a : b ~ J - 

i=i 

In some cases, the set of representations may be restricted to allow only certain 
combinations of digits. Consider 6=3 and D = {0, 1,2}. Let F be the set of 
all numbers x of the form 

OO 

x = J2 a j b ~ 3 > 

3 = i 

where each a,j is in the set D , and such that aj + cij+i < 2 for all j. This set 
is graph self-similar. 

Let F(di) be the set of numbers where the representation has a\ = d\. 
Let F(d\ 1 d, 2 ) be those numbers where the representation has ai = d\ and 
0-2 = d 2 - Then F = F( 0) U E(l) U F( 2). The set E(0) is similar to F, with 
ratio 1/3. The set F(l) = F(1,0) UF(1,1), since F(l,2) = 0. But F(1,0) 
is similar to F with ratio 1/9 and E(l,l) is similar to E(l) with ratio 1/3. 
Finally, F( 2) = F( 2, 0) is similar to F with ratio 1/9. The graph is shown in 
Fig. 6.7.13. 


Exercise 6.7.14. Compute the Hausdorff dimension of the set F. 
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1 

1/9 



Fig. 6.7.13. Graph 


Exercise 6.7.15. Let b = (1 + v / 5)/2 and D = {0,1}. Let F be the set of 
numbers of the form 

OO 

where each a,j £ D , and two consecutive digits 1 are not allowed. Describe the 
set F. 


Topological Dimension 

The addressing function, or model map, which has been developed here for 
the purpose of computing the fractal dimension of a [graph] self-similar set, 
can also sometimes be used for the topological dimension as well. The address- 
ing functions h v : E^' s — > K are continuous and surjective. The spaces E^' s 
are compact. When the overlap is small, the characterization of topological 
dimension of Theorem 3.4.19 is often applicable. Let us do some examples. 

The addressing function for the Cantor dust is one-to-one. Therefore the 
Cantor dust is zero-dimensional. 

The addressing function for the Sierpinski gasket maps at most 2 strings 
to each point. Therefore the Sierpinski gasket has small inductive dimension 
< 1. It contains line segments, so it has dimension exactly 1. 

Exercise 6.7.16. Show that McWorter’s pentigree has small inductive di- 
mension 1. 

If you have not solved Exercise 1.6.5 yet, you can now do so painlessly. 

Theorem 3.4.19 yields only an inequality. The addressing function for the 
Menger sponge (using the construction suggested in Fig. 4.1.9) is 4-to-l at 
some points, so we obtain the uninteresting result that the topological dimen- 
sion is < 3. In fact, the topological dimension is 1. Is there another way to 
produce the Menger sponge as a self-similar set such that the model map is 
only at most 2-to-one? 


6.8 *Other Fractal Dimensions 

According to Mandelbrot, a fractal is a set S with dim S' > CovS. We will 
consider (for the moment) only nonempty compact sets in Euclidean space. 


Optional section. 


6.8 *Other Fractal Dimensions 
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Mandelbrot expressed dissatisfaction with the definition for two reasons: (1) 
“borderline fractals” are excluded; and (2) “true geometric chaos” is included. 


Borderline Fractal 

What might be meant by a “borderline fractal”? This will be a set K with 
the usual features of fractals that we have seen often, but where dim if = 
Co v K anyway. To illustrate this, we will consider a curve, with a dragon- 
like construction. We begin with a sequence of positive numbers Wk, with 
wo = 1, Wk > Wk+i > Wk/2, and Hindoo Wk = 0. The first set is a line 
segment Pq with length wq = 1. If the polygon P has been constructed, 
consisting of many line segments of length Wk, then to construct Pk+i, we 
replace each of those line segments by two segments of length Wk+i- (It is 
possible to do this, and still have a polygon, since Wk+i > Wk/ 2.) If xcfe > 0 
fast enough, we can avoid having the curve cross itself (even in the limit) by 
alternating between sides of the curve, as shown in the illustration. Then the 
limit will be homeomorphic to an interval, and therefore have small inductive 
dimension 1. 

If we choose Wk that satisfy w%. = l/2 fc , for some s > 1, then this is a self- 
similar dragon curve that we have seen before. In the binary tree, if a is a finite 
string of length fc, then when we use the metric and measure appropriate for 
the tree, we have diam[a] = Wk and 3Vt([a]) = l/2 fc = (diam[a]) s . The usual 
calculation shows that the Hausdorff dimension for the curve will be s. 

But suppose we have Wk that satisfy 

w k _ l_ 
log(l /w k ) 2 k 

This means that Wk goes to zero more rapidly than (1 /2 fe ) 1 / s for any s > 1, 
but more slowly than l/2 k itself. 

Exercise 6.8.2. When 



Fig. 6.8.1. Borderline dragon 
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Wk _ 

log(l /w k ) 2 fe ’ 

the topological, Hausdorff, and packing dimensions are all 1. 
Why would we call this a “borderline fractal”? 


Box Dimensions 


We will next discuss some other fractal dimensions known as box dimensions. 
We will begin with R 2 for simplicity. But first consider a variant of the Haus- 
dorff dimension. 

If r > 0, then the square net of side r consists of all squares of the form 
A = [(to — l)r, mr) x [(n — l)r, nr) , 

where to, ro G Z. Write § r for this set of squares. So the plane R 2 is the disjoint 
union of this countable collection of squares. Write § = Ur>o &r- 

For s > 0, consider the method II outer measure M S defined using the set 
function c: § — > [0,oo) defined by: c(A) = r s if A € § r . Now any set BCR 2 
of diameter r is contained in the union of at most 4 sets of S r . On the other 
hand, a square with side r has diameter y/2r. This means that 

2~ s / 2 X s (F) < 3vf (F) < 4 5f s (F). 

Therefore sq = dim F is the critical value for which 

M (F) = oo for all s < sol 

M S (F) = 0 for all s > so- 

Some calculations involving the Hausdorff dimension are simplified by using 
this alternative to the definition (for example [23, Chapt. 5]). 

Now we discuss another variant. Fix a number r > 0, and cover only by 
sets of S r ; then let r — > 0. It should be emphasized that this is not method 
II. Now if A C S r covers a set F, then Xm<e.a rS J us ^ Nr s , where N is the 
number of elements of A. So the definition may be phrased as follows. Let 
N r (F) be the number of sets of the square net S r that intersect F. Define 

K(F ) = N r (F) r s , 

X s {F) = liminf X* r {F). 

r— ► 0 

As usual there is a critical value for s. 


Exercise 6.8.3. Let 


s o = lim inf 


log N r (F) 
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Then 


3C S (F) = oo for all s < s 0 i 

X S (F) = 0 for all s > Sq. 

The critical value sq will be called the lower box dimension or lower 
box-counting dimension or lower entropy index of F. We will write 
dim B F. The set functions X s have the same shortcoming as CP S : 

Exercise 6.8.4. There is a countable compact set K with positive lower box 
dimension. 

Because of this undesirable property, we can apply Method I to X s to 
get a metric outer measure. Or we can modify the dimension directly. The 
modified lower box dimension is 


dim MB F = inf sup dim B Fi 

i 


where the infimum is over all countable covers F C lJ igN F t of F. 

A variant is obtained by replacing liminf with limsup. The set functions 
are then 

limsup 3C®(F). 

r — >0 

The critical value for s, called the the upper box dimension,* is given by 


dim B F = lim sup 

r — »0 


logJV r (E) 

log(l/r) 


Again the set function is not an outer measure, and dim B is not countably 
stable, so define the modified upper box dimension by 

dimMB F = inf sup dim B Fi 

i 

where the infimum is over all countable covers F C [J igN F °f 
Exercise 6.8.5. Let F C R 2 . Then 


dim F < dini p F < dim B F < packing index of F, 
dim F < dim MB F < dimMB F < Dim F. 

So if F is a fractal in the sense of Taylor, these four fractal dimensions all 
coincide. (In fact, we will see below that dimMB F = DimF.) 

Exercise 6.8.6. Define the set functions X s and 3C S in the space R d . Prove 
analogs of Exercise 6.8.5. 

* Barnsley [3] uses the term “fractal dimension” for this value. 
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Boxes do not make sense in a general metric space, but a box dimension 
can be defined there anyway. Let S be a metric space, let F C S, and let r > 0. 
Define 7V r (F) to be the maximum number of disjoint closed balls with radius 
r and center in F. (Actually, instead of “disjoint” we can take the sense used 
for the packing measure: the centers Xi satisfy g{xi,xf) > 2 r for all i ^ j.) 

Proposition 6.8.7. Let FCI 2 . Then 


lim inf 

r — >0 


lim sup 

r — ^0 


log7V r (F) 
log(l/r) 
log N r (F) 
log(l/r) 


= lim inf 

r—*Q 


= lim sup 

r— ► 0 


log Nr{F) 
log(l/r) 

log N r {F) 

log(l/r) 


Proof. Let 5 > 0. There are Ns{F) disjoint balls with radius <5 and center in 
F. No two of those centers have distance < 25, so no two of those centers are 
in the same square of § 5% / 2 - So ^s(F) < N S ^(F). 

Let r > 0. In each square of S r that meets F, choose one point of F, and 
use it as the center of a closed ball of radius r/4. Any such ball intersects at 
most 4 squares of S r , and N r (F) < 4 N r /±{E). 

Therefore, for any r > 0 we have 

log N r (F) < log(4 N r/i (F)) _ log N r /j(F) + log 4 
log(l/r) _ log((l/4)(4/r)) log(4/V) - log 4 

log N r (F) > log K/V 2 (F) _ log N r/V z{F) 

log(l/r) _ log((l/^)( v / 2 /r)) \og(\/2/r) - log a/2 

But as r — > 0, we have log(l/r) -> oo, so we get the lim sup and lim inf results 
claimed. □ 


In a metric space S, define the upper and lower box dimension using N 
in place of N. Then define the upper and lower modified box dimensions as 
before. It doesn’t have “boxes” in the defintion any more. Sometimes dime 
or dini p may be called Bouligand dimension or Minkowski dimension 
instead. 


Proposition 6.8.8. Let S be a metric space and F C S. Then diniMB F = 
DimF. 

Proof. First we claim: DimF < dime F. Let t < s < DimF. Then T S (F) = 
oo, so Jo(F) = oo and 1P|(F) = oo for all 5 > 0. Now let 5 be given with 
0 < 5 < 1. Then there is a (5-fine packing 77 of F with ]C( x ,r)en rS > -*-■ ^ or 
k € N, let rife be the number of (x,r) € 77 with 2~ k ~ 1 < r < 2~ k . Then there 
is some k with nj, > 2 kt (l — 2 t ~ 2 ), since if not we would have 

OO OO 

1< J2 r s <J2 n ^~ ks <(l--^~ s )J2 2k{t ~ S)=1 - 

(x,r)an fc=o fc= o 
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For that k, we get rik disjoint balls centered in F with radius 2 k 1 , so 

N 2 -k-i(F) > rik > 2 kt (l — 2 t_2 ), 
log N 2 - k - 1 (F) kt log 2 + log(l - 2 t ~ s ) 
log(2 fe+1 ) “ (A: + 1) log 2 

Now the right-hand side goes to t as k — > oo, and for any <5 there is k as given 
with 2~ k < d , so we conclude that dime F > t. But t was any value < DimF, 
so in fact dime F > Dim F. 

Next we claim Dim F < dimMB F. Let F C Fj be a countable cover of 
F. Then 

DimF < sup Dim Fj < sup dims Fj. 

i i 

Take the infimum over all such covers to get Dim F < dimMB F . 

And finally we claim dimMB F < DimF. Let s > DimF. Then CP S (F) = 0, 
so there is a countable cover F C (J Fi with Yli ^o(Fi) < oo for all i. Fix an 
i. There is <5 > 0 with T|(Fj) < oo. Then since Ns(Fi)S s < CP|(Fj), it remains 
bounded as <5 — > 0, so dime Fj < s. This is true for all i, so sup, ; dime Fj < s. 
Take the infimum on all covers to get dimMB F < s. This is true for all 
s > Dim F, so dimMB F < Dim F. □ 

Lipschitz Condition of Order p 

Let S,T be metric spaces, and let p > 0. We say that a function f:S^T 
satisfies a Lipschitz condition of order p if there is a constant M so that 
for all x,y £ S, 

g(f(x)J(y))<Mg(x,y) p . (1) 

We may say / € Lip(p). This is also called a Holder condition of order p. 

Proposition 6.8.9. Let f:S—>T,f£ Lip(»). Then (a) p dim /LSI < dimF 
and (b) p Dim f[S] < Dim S'. 

Proof. Let M satisfy (1). (a) If A C 5, then diarn / [A] < M (diamA) p . Let 
6 > 0, s > 0, and let { A n } be a cover of S by sets of diameter < S. Then 
{/[A„]} is a cover of f[S] by sets of diameter < MS P . So 

(/[«» < E (diam/[A„]) s < M s E ( diam A„) ps . 

nGN nGN 

Taking the infimum over all (5-covers, we get 

X s MSP (f[S]) <M s X P s s (S). 

Taking the limit as 6 — > 0, we get 

K s (f[S]) < M s M ps (S). 
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If s > (1/p) dim 5, then ‘K ps (S) = 0 so E. S (f[S]) = 0, which means s > 
dim f[S}. Therefore dim/ [5] < (1/p) dim S as required. 

(b) Let 5 > 0. Define S 1 = M5 P . Then S' — > 0 as 6 — > 0. Also, log (2 /S') = 
plog(2/<5) + G for a certain constant G. Now let E C S. If u±, ■ ■ ■ , un £ f[E\ 
have g(ui,Uj ) > 6' for all i ^ j, then there exist a \ £ E with /(a:,) = Ui and 
g(xi, Xj ) > 5 for all i ^ j. Therefore 


N s/2 (E)>N s , /2 (f[E}), 


and thus dime E > pdirng f[E\. 

Let S’ = (J i Ei be a countable cover. Then 

sup dim B Ei > psupdim B f[E,,] > pdim M B /[S]. 

i i 


Take the infimum over all covers to get dimMB S > pdimMB /[S]. And by 
Prop. 6.8.8, diniMB = Dim. □ 

Theorem 6.8.10. Let 0 < p < 1. Suppose f: [a, b] — > M satisfies a Lipschitz 
condition of order p. Then the graph 

G= { (x,f(x)) : x £ [a, 6] } 


satisfies dim G < 2 — p. 


Proof. We may assume that [a, b] = [0,1]. Divide [0, 1] into n sub-intervals of 
length l/n. On each of these intervals / can vary by no more than M(l/n) p . 
Thus, the part of the graph over one of the sub-intervals can be covered by 
no more than Mn 1_p + 1 squares of side l/n. Thus 


X s V 5/ n (G)<n(Mn 1 - p + 1 ) 



M2 s/2 n 2 - p ~ s + 2 s/2 n 1 ~ s . 


If s = 2 — p, then this shows 3~C S (G) < M2 ( 2 p )/ 2 + 2^ 2 p )/ 2 , so dimG < s = 

2-p. □ 

Besicovitch and Ursell [5, part V] gave examples of functions satisfying a 
Lipschitz condition of order p (and no better) where dimG = 2 — p and other 
examples where dim G < 2 — p. 


6.9 ^Remarks 

Felix Hausdorff [32] formulated the concepts that we call today the Hausdorff 
measures and the Hausdorff dimension. Almost all of the early work on the 
subject was done by A. S. Besicovitch [5]. Mandelbrot therefore uses the term 
“Hausdorff Besicovitch dimension’.’ 
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In fact, Hausdorff proposed a much more general class of measures than 
the ones discussed here. For example, he proposed using functions of the di- 
ameter other than a power: for example, the function h(x) = x s (l/log(l/x)) 
corresponds to the construction in 6.8.2 when s = t = 1. He also proposed 
using characteristics of the covering sets other than the diameter. The seminal 
paper [32] is required reading for the aspiring expert on fractals. Computation 
of the Hausdorff dimension using self-similarity appears even in Hausdorff ’s 
paper. It was carefully worked out by P. A. P. Moran [51] for subsets of R, 
and by John Hutchinson [36] for subsets of K d . The open set condition is used 
in both of these papers. 

The similarity dimension agrees with the Hausdorff and packing dimen- 
sions also for iterated function systems in (complete separable) metric spaces 
other than Euclidean space. But the open set condition must — in general 
[59] be replaced by a strong open set condition. In addition to the prop- 
erties listed on p. 149, the closure U of the open set U must intersect the 
attractor K. 

The packing measure was introduced by Claude Tricot (but see [34, Exer- 
cise (10.51), p. 145]), and advocated by Taylor & Tricot [61], Saint Raymond 
& Tricot [60], and Taylor [62]. Fractal dimensions in addition to those defined 
here can be found in [44, p. 357 ff] and [45]. 

The generalization of self-similarity that we have called “graph self- 
similarity” has a complicated history. The version that is used here is based 
on the work of R. Daniel Mauldin and S. C. Williams [48]. The “two-part 
dust” was invented explicitly to illustrate the computation of the Hausdorff 
dimension for graph self-similar sets. 

The pentadendrite was shown to me by my colleague W. A. McWorter. 
The terdragon comes from Chandler Davis and Donald Knuth [13]. 

Topological vs. Hausdorff Dimension 

In Theorem 6.3.11, Co vS < dim S was proved only for some spaces S, such 
as compact spaces. In fact, it is true for any metric space S. A complete proof 
is in [18, Sect. 3.1]. As noted, that proof used Lebesgue integration. Recently, 
a proof without integration was published by M. G. Charalambous [9]. The 
covering dimension of a space S' is a topological property of the space. That 
is, if S is homeomorphic to T, then Cov S = Cov T. The Hausdorff dimension 
is not a topological property. The spaces {E^\ g r ) 7 where E = {0, 1} is a two- 
letter alphabet, are all homeomorphic, but the Hausdorff dimension varies as r 
varies. We know that Cov S < dim S. In fact, Cov S is the largest topologically 
invariant lower bound for dimS: 

Theorem 6.9.1. Let S be a separable metric space. Then 

Cov S = inf { dim T : T is homeomorphic to S } . 

I will prove here only the simplest case: 
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Proposition 6.9.2. Let S be a separable zero- dimensional metric space. Then 

0 = inf { dim T : T is homeomorphic to S} . 

Proof. First, S is homeomorphic to a subset T of the string space {0, l}^, by 
Theorem 3.4.4. With metric g r , the space {0,1}^ has Hausdorff dimension 
log2/log(l/r). But lim r ^ol 0 g2/log(l/r) = 0. □ 

The general case may be proved in a similar way [35, Theorem VII 5]. 
For example, the Menger sponge is a universal 1-dimensional space, so met- 
ric spaces homeomorphic to the Menger sponge, but with Hausdorff dimen- 
sion close to 1 should be exhibited. The approximation shown in Fig. 6.9.3 
suggests the idea. It is self-affine, rather than self-similar, so our meth- 
ods of computation will not evaluate its Hausdorff dimension, however. 


Two-Dimensional Lebesgue Measure Compared 
to Two-Dimensional Hausdorff Measure 

According to Theorem 6.3.6 there is a positive constant c such that IK 2 (B) = 
cC 2 (B). We will show here that c = 4/7T. 

For the lower bound on I H 2 , we need an interesting fact from two- 
dimensional geometry: Among all sets with a given diameter, the disk has 
the largest area. That is, if A is a set with diameter t, then C (A) < 7rt 2 /4.* 
The proof requires some knowledge concerning convexity in two dimensions. 
First, A has the same diameter as its convex hull, so we may assume that A 
is convex. Similarly we may assume that A is closed. Choose any boundary 
point of A; let it be the origin of coordinates. A has a support line there, 



Fig. 6.9.3. Homeomorph of the Menger Sponge 


* The corresponding fact for higher dimensions can be proved from Steiner’s sym- 
metrization construction. See, for example, [21, p. 107]. 
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Fig. 6.9.4. Polar Coordinates 


let it be the x-axis. Then the set A is given in polar coordinates (r, 9) by 
equations 

0 < r < R{6 ), 0 < 9 < 7T, 

for some function R. Now A has diameter t. So the distance between the polar 
points (R{9),9) and ( R{9 + tt/ 2),9 + n/2) is at most t (Fig. 6.9.4). By the 
Pythagorean theorem, we may conclude 

R(9) 2 + R(9 + n/2) 2 < t 2 . 


Then the area may be computed in polar coordinates: 



< 



m 2 

2 

R{9) 2 


d 9 


R{9) 


/ tt/2 


R(9 + n/ 2) 5 


t 


nt 2 


— c\9 = — . 
2 4 


2 

-d 9 

d 9 


So: a set ACM 2 with diameter t has area at most ■nt 2 /A. Then the 
argument given in Theorem 6.3.6, with a = 4/-7T, will show that (4/7r)£ 2 (i?) < 
3~C 2 (B) for any Borel set B. 

For the upper bound, we use the Vitali covering theorem [23, Theo- 
rem 1.10] or [18, Theorem 1.3.7]. Let b = d£ 2 (Q), where Q is the open unit 
square. Now a disk can be approximated inside and outside by little squares, 
so we have (by the argument in the proof of Theorem 6.3.6) df 2 (B) = bC 2 (B) 
for all disks B. The collection of all closed disks with diameter < e and con- 
tained in the square Q satisfies the hypothesis of the Vitali theorem, so there 
is a countable disjoint set { -B* : i e N} of them with C 2 (Q \ UigN -®i) = 0- 
But then, by the inequality 9~C 2 (B) < bC 2 (B), we know that !K 2 (Q \ UigN -®i) 
is also 0, so !K 2 (Q \ U;gN^) = 0. Now 
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TC 2 < y^(diami^) 2 



iEN / i— 1 


oo 


A CXJ 

= Bi ) 2 

7 T z ' 4 



Therefore b = M 2 (Q) = Jf 2 (U ieN ^) < 4 /tt. 

So we have exactly TC 2 (S) = (4/7 t)C 2 (B) for all Borel sets B. 

The same result is true in R d , namely K d (B ) = CdC d (B), where Cd is the 
appropriate constant 1/ £, d (B 1 /' 2 (A)). 

The Sim-Value of a Mauldin Williams Graph 

The sim-value of a Mauldin-Williams graph exists and is unique. The proof 
of this fact will be given here. It requires some knowledge of linear algebra. In 
particular, it requires information from the Perron-Frobenius theorem (stated 
below) . 

Let A be a square matrix. The spectral radius of A is the maximum 
of the absolute values of all of the complex eigenvalues of A. We will write 
sprad A for the spectral radius of A. 

We will use some additional notation: A > 0 means all of the entries of 
A are nonnegative, and A > 0 means all of the entries of A are positive; 
A > B means A — B > 0, and A > B means A — B > 0. The matrix A > 0 
is called reducible iff the rows and columns can be permuted (by the same 
permutation) so that A has the form 


B O 


where B and D are square matrices (with at least one row each), and O is a 
rectangular matrix of zeros. If A is not reducible, then it is irreducible. A 
column matrix with all entries 0 is 0 , and a column matrix with all entries 1 

is 1. 

Here is (part of) the Perron-Frobenius theorem. See [27, Chap. XIII] for a 
proof. 

Theorem 6.9.5. Let A > 0 be an irreducible square matrix, and let X £ R. 


Then: 


(1) If X = sprad A, then there is a column matrix x > 0 with Ax = Ax. 

(2) If there is a nonzero column matrix x > 0 with Ax = Ax, then X = 


sprad A. 
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(3) If there is a nonzero column matrix x > 0 with Ax < Ax, then A > 
sprad A. 

(4) If there is a nonzero column matrix x > 0 with Ax > Ax, then A < 
sprad A. 

Now we are in a position to prove that the dimension of a strongly con- 
nected, strictly contracting Mauldin-Williams graph exists and is unique. 

Theorem 6.9.6. Let (V, E, i, t, r) be a strongly connected, strictly contracting 
Mauldin-Williams graph. There is a unique number s > 0 such that positive 
numbers q v exist satisfying 


du = E r(e) s 

vev 

e£E uv 


for all u £ V. 

Proof. We will be using matrices with the rows (and columns) labeled by V. 
For each pair u, v £ V, and s > 0, let 


A uv {s) = ^2 r(e) s . 

e£E U v 

Let A(s) be the matrix with entry T u „(s) in row u column v. Let <£(s) = 
spradAl(s) be the spectral radius of the matrix ^4(s). Now the matrix A(s) 
has nonnegative entries. The entry A u „(s) is positive if and only if E uv is 
not empty. Since the graph is strongly connected, the matrices Al(s) are ir- 
reducible. I will prove: (a) s-dimensional Perron numbers exist if and only if 
<P(s) = 1, and (b) the equation <P(s) = 1 has a unique solution in [0,oo). 
First, suppose that s-dimensional Perron numbers exist, so that 

q s u = r ( e ) s q v 

vev 

e£E uv 

for all u £ V. Thus, if the column matrix x has entries <?*, then x > 0 and 
Al(s)x = x, so by the Perron-Frobenius theorem, 1 is the spectral radius of 
A(s). 

Conversely, suppose that 1 = spradAl(s). Then by the Perron-Frobenius 
theorem, there is a column matrix x > 0 with Al(s)x = x. If we write x v for 

1/s 

the entries of x, then the numbers q v = x v will be s-dimensional Perron 
numbers. 

Next, I claim that the function <I> is continuous. Certainly the entries 
A U v(s) of the matrix are continuous functions of s. Fix a number s o- Let 
x > 0 be the Perron-Frobenius eigenvector: ^4(so)x = <P(so)x. Let the entries 
of x be i„. Define positive numbers a, b by a = miner,,, b = maxx v . Suppose 
V has n elements, so the matrices are n x n. Let e > 0 be given. By the 
continuity of the entries A uv , there exists S > 0 so that if | s — so | < 5, then 
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|A u 1 J (s) A u 1 J (so)| ^ 


ae 

nb 


for all u , v. Now we have 

E A uv(s)x v = Y, -A-uv (s 0 )x v + E(^ uv(s') A-uv (so))x v 

V V V 

CLS 

< &{s 0 )x u + n — b < (£>(s 0 ) + s)x u - 

Therefore, by the Perron-Frobenius theorem, <P(s) = sp rad A(s) < ^(so) + e. 
Similarly <£(s) > <P(sq) — e. This shows that (fr is continuous. 

Since the graph is strongly connected, each row has at least one nonzero 
entry. So for each u there is v with A„„(0) > 1. Therefore A(0)1 > 1, so that 
*(0) > 1- The entries A uv (s ) — > 0 as s — > oo, so for large enough s, we have 
A uv (s) < 1/(2 n) for all u,v, so that A(s)l < (1/2)1, and thus ^(s) < 1/2. 
Now by the intermediate value theorem, there is a solution s to the equation 
<2>(s) = 1. 

Finally, to prove the uniqueness, we will show that <P is strictly decreasing. 
The derivative of A uv is <0, and in fact < 0 unless A uv is identically 0. Each 
row has at least one nonzero entry, so if s > so and x is the Perron-Frobenius 
eigenvector for A(s 0 ), we have A(s)x < A(so)x = ^(s 0 )x. So <P(s) < <P(so). 
Therefore the function <P is strictly decreasing. □ 

Exercise 6.9.7. Let (V) E,i,t,r) be a contracting, strongly connected Maul- 
din-Williams graph. Are the conclusions of Theorem 6.9.6 still correct? 

To compute the dimension of a strictly contracting, strongly connected Maul- 
din— Williams graph, we would ordinarily find the numbers s such that 1 is an 
eigenvalue of the matrix A(s). If that s is unique, it is the dimension. If not, 
then we find for each s the corresponding eigenvector for A(s); only one of 
the values s will admit an eigenvector with all entries positive. 


Remarks on the Exercises 

Exercise 6.1.10: f: S — > T is Lipschitz, and ACS. Say Q(f{x),f(y)) < 
bg(x, y). If T> is a countable cover of A by sets with diameter at most e, then 
D' = { f[D\ : Be®} is a countable cover of f[A ] by sets with diameter at 
most be. Now 

(diam/[D]) s < b s ^ (diamU) s , 

DeD DgD 

so 9Lj e (/[A]) < & S 9L*(A). Therefore dim /[A] < dim A. The case of inverse 
Lipschitz is similar. 

Exercise 6.3.12: Suppose Cov S > 1. Then S does not have a base for the 
open sets consisting of clopen sets. So there is a point a £ S and e > 0 such 
that for 0 < r < e, the ball B r (a ) is not clopen. The function h: S — > M 
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Fig. 6.9.8. All edges have value (3 — %/5)/2 


defined by h(x) = g(x,a ) satisfies | h(x) — h(y) \ < g(x,y). Its range includes 
the interval (0,£). So we have dim S' > dim h[S] > dim(0,e) = 1. 

For Exercise 6.5.11: The terdragon boundary is made up of two copies 
of the 120-degree dragon of Fig. 1.5.8. The open set condition (Plate 13) is 
satisfied by an open set the shape of the filled-in fudgeflake (Fig. 1.5.8); it may 
be thought of as the union of three terdragons. Exercise 6.7.11. Fig. 6.9.8. 

Exercise 6.7.12. 1.22. 

Exercise 6.7.5. [48]. 

Exercise 6.7.14. The graph of Fig. 4.3.13. This is a special case of the 
situation considered in [17]. 


The fractile lines of the sandstone. 
— Scribner's Magazine, April, 1893 
(quoted in the Oxford English Dictionary ) 
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This chapter includes additional examples of fractals, and hints at parts of the 
subject that we have not covered. In Sections 7.3 and 7.4, the computation 
of the fractal dimension requires more than just a simple application of the 
results of Chap. 6. So these examples show that there is more to the subject 
than we have seen in this book. 


7.1 *Deconstruction 

Let K be the attractor for an iterated function system (f e ) of similarities in 
M d , and let the ratio list have sim- value s. Then dimil < Dim K < s (The- 
orem 6.4.10). If the parts f e [K] are disjoint, or have small overlap (specified 
by the open set condition), then dim A' = Dim AT = s (Theorem 6.5.4). But if 
there is too much overlap, then the dimension could be strictly smaller than 
s. Sometimes the fractal dimension can be still computed by “deconstructing” 
the attractor —interpreting it in a manner different from the one provided by 
the iterated function system. 

Recall the Li’s lace fractal on p. 84. It is made up of blocks of two kinds, 
P, Q , as described on p. 126. From the decompostions of the two sets, we get 
P I) Q. Note that Fig. 4.3.3 not only describes the iterated function system, 
but also shows that the graph open set condition is satisfied. So we have 
dimP = dim Q = DimP = DimQ = s, where s = log(v / 2 + 2)/log2 ss 1.7716 
is the sim- value for the M-W graph shown in Fig. 4.3.4(b) 

But that was not the way in which this fractal was originally defined. 
Example (c) in Jun Li’s thesis [43] is described using an iterated function 
system in the plane consisting of four similarities: 


* Optional section, 
t Answer for Exercise 4.3.11. 
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Fig. 7.1.1. A fractal and a surrounding square 
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The attractor is shown in Fig. 7.1.1, and the iterated function system is illus- 
trated in Fig. 7.1.2. 

Deconstruction. We claim that if P, Q are the sets described on p. 126, and 
four of these are assembled into a square F as shown, 



then F is the attractor of the iterated function system 7.1.2. But we know 
that the attractor is unique (Theorem 4.1.3), so we need only prove the self- 
referential equation F = fi [P] U / 2 [P] U / 3 [P] U [P] . We will prove this using 
pictures, which can sometimes be more convincing than words. 
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We should show that 



The outer triangles match as required. The triangles in fa[F] do not align 
with the others, so to see how the union behaves we require another level of 
subdivision. We need to prove that the inner square 
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Everything now matches, taking into account P U P = P and P U Q = P. 

In this example, then, the iterated function system 7.1.1 has sim-value 2, 
but the attractor it defines has fractal dimension log(V2 + 2)/ log 2 ss 1.7716. 


More Self-Similar Sets with Overlap 

Now we consider a “similarity dimension” example with overlap in K, and 
leave the details to the reader. We will use the ratio list (1/5, 1/5, 1/5). 
Let a, b, c be three real numbers. Consider the three dilations /i,/ 2,/3 of M 
with fixed points a, b, c, respectively. For certain choices of the points a, b, c, 
this realization satisfies the open set condition, and the invariant fractal K 
has Hausdorff dimension equal to the sim-value of the ratio list. For certain 
other choices of a, 5, c (such as two or three of them coincident) the Hausdorff 
dimension of K is not equal to the dimension of the ratio list. There is, 
nevertheless, always an inequality. 

Let us normalize things by assuming a = 0,c=l,0<6<l. (Any choice 
of three distinct points can be reduced to this case.) All three of the maps 
send [0, 1] into itself, so the invariant set K is a subset of [0, 1]; in fact K may 
be constructed in the usual way by the contraction mapping theorem starting 
with [0, 1]. 

Exercise 7.1.3. For what values of b are the three images of the open interval 
(0, 1) disjoint? 

Exercise 7.1.4. Compute the Hausdorff dimension when b = 1/10. 

Exercise 7.1.5. Compute the Hausdorff dimension when b = 1/5. 

There is a result of Falconer [24] that is relevant in situations like this. 
In this case it asserts that the Hausdorff dimension of the invariant set K is 
equal to the similarity dimension log 3/ log 5 for almost all choices of & € [0, 1] . 
That is, the set of all b £ [0,1] for which dim A' = log 3/ log 5 fails is a set of 
Lebesgue measure 0. 

Exercise 7.1.6. Give an example of an iterated function system of similarities 
in where the Hausdorff dimension of the invariant set coincides with the 
similarity dimension, but Moran’s open set condition fails. 

Consider the Barnsley leaf fractal B defined on p. 26. It is the attrac- 
tor of an iterated function system with three maps, and sim-value 21og(l + 
V2)/log2. This is > 2, so it is certainly not the fractal dimension of the 
fractal B itself. Plate 10 shows the set B. The three images are in three col- 
ors cyan, magenta, and yellow. Where cyan and magenta overlap, it is blue. 
Where all three overlap it is black. 

Exercise 7.1.7. Deconstruct Barnsley’s leaf, and determine its fractal dimen- 


sion. 
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7.2 *Self-AfRne Sets 

The idea of an iterated function system makes good sense even when the maps 
are not similarities. One possibility that comes up often involves affine maps. 
The invariant set is then said to be self-affine. In the general self-affine 
case the evaluation of the Hausdorff dimension is not completely understood. 
It has even been argued [45] that the Hausdorff dimension is not the proper 
dimension to use at all. We will present a few examples in this section. 

A Self- Affine Dust 

As a reference, take the unit square in M 2 : 

S = { (x, y) : 0<x<1,0<?/<1}. 

The images of S under the two maps will be two rectangles: 

f?i = { (x, y) : 0 < x < 1/2, 0 < y < 2/3 } 

#2 = { {x, y) : 1/2 < x < 1, 1/3 < y < 1 } . 

The function /i is an affine map of M 2 onto itself, and sends the vertices of 
S to the corresponding vertices of R\. The function fa is an affine map of JR 2 
onto itself, and sends the vertices of S to the corresponding vertices of f? 2 - 



Fig. 7.2.1. Self-Affine Dust 
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Fig. 7.2.4. Kiesswetter’s curve 


Exercise 7.2.2. There is a unique compact nonempty set K CM. 2 such that 
K = h[K\ Uf 2 [K\. 

Exercise 7.2.3. Compute the Hausdorff dimension of the set K. 


Kiesswetter’s Curve 


This is illustrated in two different ways. The set can be decomposed into four 
subsets, which are affine images of the whole thing. The four affine maps may 
be written in matrix notation. A point {x, y ) in the plane is identified with a 
2x1 column matrix. 


h 


f 2 

h 

h 


X 


1/4 0 


X 



y. 


0 -1/2 

y. 



X 

y. 

= 

1/4 0 ' 
0 1/2 


c 

i_ 

f 

'1/4 
-V 2 

X 

y. 

= 

1/4 0 ' 
0 1/2 


c 

i_ 

f 

1/2 

0 

X 


1/4 0 ' 


c 

I 

'3/4' 

y. 


0 1/2 


)_ 

r 

1/2 


The first construction starts with the rectangle Mq = [0,1] x [—1,1], and 
at each stage replaces the current set M n with M n+ 1 = fi[M n ] U f 2 [M n ] U 
/ 3 [AT n ] U f 4 [M n ]. Because each of the maps fj sends M 0 to a subset of M 0 , 
this results in a decreasing sequence of compact sets. Kiesswetter’s curve is 
the intersection |"| n M n . 

The second construction starts with the line segment from (0,0) to (1, 1), 
and makes the same transformation as before. Since /j((l,l)) = /j+i((0,0)) 
for j = 1,2,3, these sets are all polygons. They are graphs of a sequence of 
continuous functions defined on [0, 1]; this sequence converges uniformly. The 
limit g is called Kiesswetter’s function. Its graph G = {(x,y) : y = g(x) } 
is Kiesswetter’s curve. 
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Fig. 7.2.5. Kiesswetter’s curve 


Exercise 7.2.6. Let g be Kiesswetter’s function. Then for any integers k > 0 
and 0 < j < 2 fc , prove 



Exercise 7.2.7. Kiesswetter’s curve is the graph of a continuous but nowhere 
differentiable function g : [0, 1] — ► M. 

Exercise 7.2.8. Find the Hausdorff and packing dimensions of Kiesswetter’s 
curve. 

Besicovitch— Ursell Functions 

Besicovitch and Ursell investigated the dimension of the graphs of non- 
differentiable functions. The most famous examples of these functions, dating 
back to Weierstrass, have a form 


OO 



for appropriate choices of a & and b *.. A simpler variant was used by Besicovitch 
and Ursell, which will now be described. 

Define a “sawtooth” function g : M — > R. by: 


g( x) = x for —1/2 < x < 1/2 
g(x) = 1 — x for 1/2 < x < 3/2 
g(x + 2) = g(x) for all x. 


If 0 < a < 1, the Besicovitch Ursell function with parameter a is: 


OO 


/ 0*0 = ^2a k g{2 k x). 
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Partial sums a k g{2 k x) of the series are illustrated in Fig. 7.2.10, with 

a = 0.6. The pictures show only 0 < x < 1, but the rest of the graph is simply 
related to this part. 

Exercise 7.2.11. The function f(x) exists and is continuous. 

Exercise 7.2.12. Is the graph of / the invariant set for some iterated function 
system? 

Pictures for various values of a are shown in Fig. 7.2.15. 

Exercise 7.2.13. For what values of a is / a Lipschitz function? 

Exercise 7.2.14. Compute the Hausdorff dimension of the graph of the 
Besicovitch-Ursell function with parameter a = 3/5. 


Hironaka’s Curve 

Pictured (Fig. 7.2.16) are some approximations to Hironaka’s curve. The 
first approximation consists of two vertical line segments, one unit long, one 
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Fig. 7.2.15. Besicovitch-Ursell Functions 


unit apart. For each subsequent approximation, additional line segments are 
added. The length of the new line segments is decreased by a factor of 1/2 at 
each stage. The distance between the line segments is decreased by a factor 





Fig. 7.2.16. Hironaka’s curve 
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of 1/3 at each stage. The position of the line segments is determined by the 


pattern illustrated. Hironaka’s curve is the limit set. 

Exercise 7.2.17. Find topological and Hausdorff dimensions for Hironaka’s 
curve. 

Number Systems 

Here is a way to generalize the “number systems” of Sect. 1.6. Elements of M d 
should be identified with d x 1 column matrices. Let D be a finite set in R d , 
including 0, and let B be a d x d matrix. What conditions should B satisfy 
so that all of the following vectors exist? 



where the “digits” a 3 - 6 D. The set F of all these vectors is the invariant set 
of an iterated function system of affine maps. 


7.3 *Self-Conformal 


An affine transformation that is not a similarity changes distances by different 
ratios in different directions. Here we will talk about non-affine transforma- 
tions that change distances by the same ratio in all directions, but only in the 
limit near a point. 

Let S' be a metric space, let /: S — > S be a transformation, let r > 0, and 
let a £ S. We say that / is conformal at a with ratio r if: 



More technically stated: for every e > 0 there is <5 > 0 such that for all 
x,y £ S, if g(x,a) < S and g(y,a) < <5, then 


(l-e)rg(x.y) < g(f(x),f(y)) < (1 + e)rg(x,y). 


We say that / is conformal on a set E if a is conformal at every point of E, 
but not necessarily with the same ratio. 

Proposition 7.3.1. Let f: R — > R be continuously differentiable, let a £ R, 
and assume f'(a) 0. Then f is conformal at a with ratio |/'(a)|. 
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Proof. Let e > 0 be given. Then since f is continuous, there is d > 0 so that 
if \x — a\ < S , then 


Now let x,y satisfy \x — a| < 6, \y ~ a\ < 6 , and x ^ y. Applying the Mean 
Value Theorem on the interval from x to y , we conclude there is z between x 
and y so that 

/W ~ f(V) = f(z). 
x-y 


Now \z — a| < <5, so we have 


l/( f / , (y)1 = 1/^)1 <(l + g )|/ / (o)|, 

\ x - y I 

so that 

I f{x) - f(y ) I < (1 + e)\f'(a)\ \x — y\. 

Similarly, 

l/(*) - f(v)\ > (! - ^)l/'(a)| \x-y\. 

Thus / is conformal at a with ratio |/'(a)|. □ 

Is continuity of the derivative required? 

Exercise 7.3.2. Give an example where /: R — > R is differentiable at a point 
a with f(a) ^ 0, but / is not conformal at a. 

If you have studied multi-dimensional calculus, you can attempt the next 
exercise. 

Exercise 7.3.3. Let / : R d — > R d be continuously differentiable, and let a £ 
R d . If the derivative Df(a ), interpreted as as d x d matrix, defines a similarity 
on R d , then / is conformal at a, and the ratio of / at a is the same as the 
ratio of the similarity Df(a). 

In Euclidean space R d , examples of conformal maps (where they are de- 
fined) are: translation, rotation, reflection in a hyperplane, reflection in a 
sphere. In particular, in R 2 , reflection in a circle (p. 28) is conformal. Of 
course the ratio is not the same everywhere. 

In the mathematical subject of complex analysis, you can find this: 

Proposition 7.3.4. Let / : C — > C be continuously differentiable in the com- 
plex sense, let a £ C, and assume f'(a) 0. Then f is conformal at a with 

ratio |/ 7 (a)|. 
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If a function / (defined for complex numbers z) has the form 


/(*) 


az + b 
cz + d 1 


where a, b,c,d £ C, then / is called a linear fractional transformation. 
If ad — be = 0, then / is constant, so we will assume ad — be ^ 0. Then / is 
defined everywhere in C except z = — b/a , and 


/'(*) 


ad — be 
{cz + d ) 2 


is never zero. So / is conformal. An important property of a linear fractional 
transformation is that it maps circles to circles (provided a line is considered 
to be a circle). 

The attractor of an iterated function system consisting of conformal 
maps is known as a self - conformal set. Of course self-similar sets are self- 
conformal. Pharaoh’s breastplate (p. 30) is self-conformal but not self-similar. 


Appolonian Gasket 

Figure 7.3.5 shows a subset of the plane. The first approximation is obtained 
by taking three mutually tangent circles with radius 1. The set Co is the region 
enclosed by three arcs (including the arcs themselves). Each approximation 
will consist of some regions bounded by three mutually tangent circular arcs. 
To obtain Ck+ i, remove from each region of Cfc the circle in the region tangent 
to all three of the arcs. (The boundary of the circle remains.) The Appolonian 
gasket is the “limit” (intersection) of the sets Cfc. 

The gasket is is self-coinformal. It is not self-similar or self-affine. 

Exercise 7.3.6. The Appolonian gasket is an invariant set for an iterated 
function system of linear fractional transformations. 

Exercise 7.3.7. Discuss the topological dimension and fractal dimension of 
the Appolonian gasket. 



Fig. 7.3.5. Appolonian Gasket 
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Julia Set 

Recall the Julia set described on p. 28 for the function ip(z) = z 2 + c, where 
c = —0.15 + 0.72i. The two branches fo,fi of the function sj z — c are not 
continuous on the set J that we construct, so it is not so simply interpreted 
as the attractor of an iterated function system. 

The Julia set J is the union of two sets U and L = —U. See Fig. 7.3.8. There 
are choices of inverse maps fo{z), fi(z) = —fo(z) for tp that are continuous on 
U and choices of inverse maps go(z) 7 g\{z) = — <7 0 (~) for ip that are continuous 
on L so that 


U = f 0 [U}Ug 0 [L}, V = fi[U] U gi[L\. 

See Fig. 7.3.9. The point c does not belong to J, and \Jz — c is conformal ex- 
cept at the point c. All four maps are conformal on their appropriate domains, 
since they are continuous branches of \J z — c. Sets U and L are isometric; U 
is the attractor of the iterated function system consisting of conformal maps 
fo(z),go(—z). So U is self-conformal. 




Fig. 7.3.9. fo[U] and go[L] make up U 
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7.4 *A Multifractal 

Most of the fractals that have been considered in this book are closed sets (or 
even compact sets). We will discuss now an example that is not closed. Man- 
delbrot calls it the Besicovitch fractal. It was studied by Besicovitch and 
Eggleston; more recently it occurs in the physics literature in connection with 
“multifractals” or “fractal measures” The proof will require some knowledge 
of probability theory, however. 

Given x £ [0, 1], consider its binary expansion, x = a *2 - \ where each 

ai is 0 or 1. We are interested in the frequency of the occurrence of the digit 
0. More precisely, let Kn\x) be the number of 0’s and Kn\x) the number 
of l’s occurring among the first n digits, (ai, 02 , • • • , a n ). The frequencies in 
question are 


F(°>(x)= lim 

n — »oo TL 

F™(x) = lim K ™ ){ - X) 


n — kx) n 


(Of course, the limits in question exist for only some x £ [0, 1].) 
Fix a number p , with 0 < p < 1. We are interested in the set 


5 P = | x G [0, 1] : exists, and F^°\x) = p j . 

We will compute the Hausdorff dimension of the set S p . If we write q = 1 —p, 
so that F(°\x) = p implies F^(x) = q, then we will show that 


dim S p = 


-plogp- glogg 
log 2 


The proof will use a string model, as usual. But it will also use the “strong 
law of large numbers” an important result from probability theory. 

Before we turn to the proof, let us consider the set S p more carefully. Note 
that [0, 1] is not equal to (Jq< p <i <Sj» since the limit F(°)(x) does not exist for 
many x. 

I will next prove that the set S p is a Borel set. (This is the first example 
we have seen where measurability is not immediately obvious.) First, given 
a, b, n, the set 

| x : a < (x) < b j 
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is a Borel set, since it consists of a finite number of intervals of length 2 n 
Then 


S p = 


x : lim 

n — kx) 


K^\x) 


= P 


-nun 

ke N NeN n>N 


x ' r ~k- 


i . k!, 0 > (x) 


ip+ k 


so S p is a Borel set. 

Next, note that if x and y agree except in the first k digits, then F^(x) = 
p(°)(y). So any open interval in [0,1] intersects S p . That is, S p is dense in 
[0, 1]. Certainly S p ^ [0, 1], so of course S p is not closed. 

If the digits of x are all shifted to the right, and a new digit is added on 
the left, then the frequencies are unchanged. So S p exhibits a natural self- 
similarity: If x £ [0, 1], then F^°\x) = F^\x/ 2) = F^\l/2 + x/2). Thus the 
two similarities 


have the property 


fo(x) 

AO) 

= f 0 [s p ] u MS P ], 


x 

2' 

x + 1 
2 


with no overlap. The similarity dimension of the iterated function system 
(/ 0 ,/i) is 1. The conclusion is: similarity dimension may be misleading for 
non-closed sets. 

Theorem 7.4.1. The Hausdorff dimension of the set S p is 


-plogp- glogg 
log 2 


Proof. Let E = {0, 1} be our two-letter alphabet, and recall the “base 2” 
model map h : — > [0,1] defined on p. 14. Then h[E^] = [0,1]. Also, 

diam[a] = diam/i[[a]] = 2~ n if a £ E^ n \ We define frequencies for strings in 
the same way as for numbers: For a £ E^**, let K^°\a) be the number of 0’s 
in a, let K^ 1 \a) be the number of l’s in a. For cr £ E^ let 


F ( 0 ) (ct) 

F ( 1 ) (ct) 


lim 

n — >oo 

lim 

n— > oo 


K^ialn) 

n 

K^\a\n) 
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These limits are defined for some strings a £ E^\ and not for others. Let 

F(°)(a)=p}. 

Then clearly S p = h[T p \. 

Now consider a measure M p defined on as follows. Let a £ E^ n \ If 
k = K^^a), n — k = K^^a), let w a = p n ~ k q k . Then w a = wq + wi, so 
these numbers define a metric measure M p on E^ with M p ([a]) = w a for 
all a £ E(*l 

Now we require the result from probability theory. According to the mea- 
sure just defined, the “digits” of a constitute independent Bernoulli trials, 
with probability p of outcome 0 and probability q = 1 — p of outcome 1. So 
by the strong law of large numbers (for example, [7, Example 6.1]), we have 

M p(T p ) = 1, or, equivalently, M P (E ( - UJ ' ) \ T p ) = 0. 

We will take the case p < 1/2. The case p > 1/2 is similar, and the 
case p = 1/2 is the usual measure Mp 2 and dimension 1 computed before 
(Proposition 6.3.1). 

We begin with the upper bound, dim5 p < s. Let e > 0 be given, and let 
N £ N satisfy 2~ N < e. Let q ' < q. We will show that dim S p < s', where 
s' = (-q'logq- (1 - q')logp)/\og2. 

Consider the set G C £?(*) defined as follows: if a € E^ n \ and k = K^(a), 
then a £ G iff k/n > q'. For such a, we have diam[a] = 2~ n and 

M P (W) =p n ~ k q k =p n Q 

q'n 

— p^-l'^qq'n 

= (2 _n ) s = (diam[a]) s . 

Let G' be the set of all a £ G with length |a| > N but a\n ^ G 
for N < n < |a|. That is, a belongs to G, but no ancestors of a (ex- 
cept possibly ancestors before generation N ) belong to G. If a £ T p , then 
limn^oo LfW(er|n)/n = q > q', so for some n > TV we have a\n £ G, and 
therefore for some n > TV we have a\n £ G' . So 

{ [a] : a £ G' } 

is a disjoint cover of T p . But 

5Z (diam[a]) s < M p([ a D 

aeG' aeG' 
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Therefore TC* (T p ) < 1. Let e — > 0 to conclude !K S (T p ) < 1, and therefore 
dim T p < s'. Now when g' -> g we have s' — > s, so dimT p < s. 

Now the model map h has bounded decrease, so dim S p < s. (Or, cover S p 
with the sets /i[[a]] , a £ G'.) 

Next I must prove the lower bound, dim S p > s. Let q' > q , and define 
s' = (—q' log q — (1 — q') logp) / log 2. I will show dim S p > s'. Now 

M p | a £ EM : lim a(1) H») = q J = Mp (T p ) = 1, 


and therefore 


< cr : there exists N £ N such that for all n> N, 


K^^crln) 


< q 
= 1. 


So by countable additivity, 


N — >oo 


lim Mp < cr : sup 


K^\a\n) 


< q 


n>N 


Choose N so that M P (F) > 1/2, where 


rp i K^\a\n) , 

F = l a \ sup < q 

n>N Tl 



Let £ = 2~ N . 

Suppose A is a countable cover of S p by sets A with diam A < e. First, we 
reduce to a cover by intervals of the form h [ [a] ] . Each set A £ A is covered 
by (at most) three of the intervals /i[[a]], where the length |a| is the integer 
n with 2~ n < diam A < 2~ n+1 . Let G C be the set of all these a. (We 
may assume that the sets [a] are disjoint, since if two of them intersect, then 
one is a subset of the other, so we may delete the smaller one.) Thus 

(diam[a]) s <3 (diam A) s 
aeG AeA 

and Tp C UaecW- so UagcW) = L 

For a £ G we have |a| > N. If [a] fl F / 0, |a| = n and K^\q) = k, then 

M p ([a})=p n - k q k =p n Q 

( \ q' n 

C l\ =p (l -q')n q q’n 
= (2“") s = (diam[a]) s . 
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Now if G' = { a € G : [«]nf/0}, then 

\ < Mp(F) < ( U W 

VaSG' 

= J2 M p([ a ]) < J2 ( diam [«]) s 

aeG 1 aGG 1 

< ^2 (diam[a]) s < 3 ^(diamA) 5 . 

ckGG- .<4.£*/l 

Now A is any cover of S p by sets of diameter < e, so (S p ) >1/6. Therefore 

JC S ' (S p ) > 1/6, so dim^p > s'. Now let q' — ► q to obtain dim S p > s. □ 

Exercise 7.4.2. Compute the box dimension (the lower entropy index) of the 
set S p . 

Exercise 7.4.3. Compute the packing dimension DimS’p. 

Exercise 7.4.4. Let £ be a finite alphabet, let M be a metric measure on 
the space of infinite strings, and let g be a metric on E^K Suppose t is 
a positive real number, and let 


If 0 < M(5) < oo, does it follow that dim S = t? 


S = \ cr € E iu) : lim 


logM([cr fn]) 


OO log diam[cr|"n] 


7.5 *A Superfractal 

Examples called “Kline curves” were included in the first edition of this 
book. It was included as extra material that could be assigned to students for 
independent investigation. The Kline curves provide examples of parametric 
curves in the plane where the Lipschitz classes of the two coordinate functions 
and the fractal dimension of the curve itself can be controlled independently. 
Kline’s paper [40] was published in 1945. 

Around 2005, motivated by their study of fractal methods for picture gen- 
eration, Barnsley, Hutchinson, and Stenflo developed the superfractal for- 
malism (see Barnsley’s book [4]). Unexpectedly, the Kline curves give us an 
interesting example of a superfractal. 
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Fig. 7.5.1. Kline rules 


Kline curves 

The Kline curves are subsets of the plane 1R 2 . They are constructed using 
approximation by “Kline polygons’.’ We begin with the line segment from the 
point (0,0) to the point (1,1); it is the diagonal of the rectangle (actually a 
square) [0,1] x [0,1]. There are three rules used to build more complicated 
Kline polygons. Each of them replaces each of the line segments by three 
line segments. Rule a is implemented by subdividing the horizontal dimension 
of the containing rectangle in thirds, and replacing the diagonal by a three- 
part zig-zag, as illustrated. Rule b is implemented by subdividing the vertical 
dimension of the containing rectangle in thirds, and replacing the diagonal by 
a three-part zig-zag. Rule c is implemented by subdividing both the horizontal 
and vertical dimensions by three, and replacing the line segment by three parts 
of itself, inside the three diagonal subrectangles. 

Each Kline polygon is obtained by applying these three rules in some 
order. Each finite string built from the alphabet {a, b, c} may be considered a 
“program” for the construction of a polygon. Several examples are illustrated 
in Fig. 7.5.3. We will write Kline [a] for the Kline polygon corresponding to 
the string a. 

Now let a £ {a, b, c}^ be an infinite string. The Kline curve Kline[cr] 
is the limit of the Kline polygons Kline [cr[/c] as k increases. 
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Kline[acb] 



Fig. 7.5.3. Kline polygons, Kline curve 


Exercise 7.5.2. If er is an infinite string from the alphabet {a,b,c}, then 
Kline [er |" k\ converges in the Hausdorff metric. 

What is the justification of the use of the word “curve”? The “natural” 
parameterization of a polygon Kline [a] with 3 n segments of equal length (n = 
|a|) is obtained by subdividing [0, 1] into 3" subintervals of equal length, and 
mapping each of the subintervals affinely onto the corresponding segment of 
the polygon. 

Exercise 7.5.4. Determine necessary and sufficient conditions on the string 
a for the natural parameterizations of the polygons Kline[cr|"n] to converge 
uniformly to a parameterization of the Kline curve Kline [er] (which will again 
be called the natural parameterization). 

There are two periodic strings er that deserve special mention. They are 
cases where Kline [a] specializes to curves which we have seen before. For the 
constant string ccc • • • , the Kline curve is a line segment. It has Cov = dim = 
Dim = 1. For the period-two string abab - - - , the Kline curve is the Peano 
space-filling curve (p. 70). It has Cov = dim = Dim = 2. Clearly for a general 
string a , the topological dimension Cov Kline [er] is either 1 or 2. And the 
fractal dimension satisfies 1 < dimKline[<r] < 2. 

Exercise 7.5.5. Prove necessary and sufficient conditions on the string a for 
Cov Kline [er] = 1. 

Exercise 7.5.6. Let er £ {a,b, be the program for a Kline curve. For 
n£ff, let a n (cr) be the number of times the letter a occurs in the restriction 
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afn. Similarly, let b n (cr) be the number of times the letter b occurs and c n (a) 
the number of times the letter c occurs. Assume the limits 


= lim 


an(cr) 

n 


f3 = lim 


b n (a) 


7 = lim 


c n (cr) 

n 


exist and a > j3. Show that the Hausdorff dimension of Kline [a] is 

2 ^ + 7 = 2 _ 7 = 2 _ 7 

a + 7 cr + 7 1 — /3 

Exercise 7.5.7. Discuss the packing dimension of a Kline curve. 

Exercise 7.5.8. Let a £ {a, b, assume a,j3 , 7 exist as in Exercise 7.5.6, 
and assume 1 > a > (3 > 0. Let (<p(t) , , t £ [0,1], be the natural 

parameterization of Kline [a]. Show that 


ip £ Lip 



£ Lip 



SuperlFS 

Let us review the definition of a hyperbolic iterated function system and its 
attractor. We have a complete metric space S. We have a finite index set, 
an alphabet, E. For each letter e £ E we have a contractive Lipschitz map 
f e : S — > S. The data of the iterated function system let us define a map 
F: H(S) -> H (5) by 

F{A) = 1J f e [A). (1) 

e£E 

Sometimes we use the same letter (here F) to refer either to the iterated func- 
tion system {f e ) e ^E itself or to the corresponding map ( 1 ) of the hyperspace. 
The attractor for the iterated function system F is the fixed point, the unique 
K £ H(5) satisfying the self-referential equation F(K) = K . The attractor K 
may be described using a string model. The addressing function h : E ^ — > S 
defined on the string space E^ is defined by 

h(a) = lim fa-\n(x), 
n 

where the limit is independent of the point x £ S. The range of h is K . 

A superlFS is, roughly speaking, an IFS where the space is a hyperspace 
H(S') and the maps are themselves IFSs on S. The space S itself is where we 
are interested in describing sets, but there are two layers of data used to do it. 

A more precise description: Let S’ be a complete metric space. Let E be 
a finite set, an alphabet. For each e £ E, let F e be a hyperbolic IFS on the 
space S, so that the corresponding map F e : H(S') — > H(£) is a contractive 
Lipschitz map. (The IFSs F e all act on the same space S, and each one has 
an alphabet and set of maps. Their alphabets may or may not be the same 
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as each other, and probably are not the same as the master alphabet E.) An 
IFS (F e ) e £E constructed in this way is known as a superlFS on S. So we 
define a map F: H(H(5)) -> H(H(S)) by 

F(-A) = 1J F e [A ] . (2) 

eG E 

Sometimes we use the same letter (here F) to refer either to the SuperlFS 
( F e ) e( zE itself or to the corresponding map (2) of the hyperhyperspace. And 
there is an attractor: a unique X £ H(M(5)) satisfying the self-referential 
equation F(3C) = X. This is called a superfractal.* There is, as usual, a 
string model. For any string a £ define 

h(cr) = lim F atn (A). 

n 

This limit is taken according to the Hausdorff metric in H(5), and it does not 
depend on the starting set A £ H(S). So h: E ^ — > .S' is continuous, and its 
range is the superfractal X. 



Fig. 7.5.9. Kline IFSs 


In [4] this is called a “1-variable superfractal” and that book also discusses 
F-variable superfractals. 
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Fig. 7.5.10. Kline superfractal 


The Kline curve construction described above is an example of a superfrac- 
tal. Space S is the unit square. The master alphabet is E = {a, b, c}. The three 
IFSs are shown in Fig. 7.5.9. The superfractal X attractor is made up of all 
the Kline curves. The addressing function is cr i — > Kline [cr]. Some Kline curves 
are shown in Fig. 7.5.10. A family resemblance among the images reflects the 
fact that they all arise from a single superfractal X. 

See Barnsley [4, Chap. 5] and the references there for additional material 
on superfractals. 


7.6 *Remarks 

Robert Strichartz took the modern literary term “deconstruction” for use 
with iterated function systems. Deconstruction of a literary or philosophical 
text may mean finding meanings that were not intended by the original author. 
So deconstruction of the invariant set of an IFS means decomposing it in a 
way different from the one provided by the original iterated function system. 

Karl Kiesswetter’s curve is from [39]. It was proposed as a particularly 
elementary example of a continuous but nowhere differentiable function. 

Theorem 7.4.1 is due to A. S. Besicovitch [5, Part II] and H. G. Eggle- 
ston [21]. Another proof is given by Patrick Billingsley [6, Section 14]. 
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Comments on the Exercises 

Exercise 7.1.4. The open set condition is satisfied, but not by the open set 

(0,1). 

Exercise 7.1.5. The set K is contained in [0, 1], Consider 4 parts of the set: 

A = K n [4/5, 1] 

B = Kn [1/5, 4/5] 

C = K n [4/25, 1/5] 

D = K D [0,4/25], 

and observe that (1/5 )A C C. Show that the graph similarity obeys Fig. 7.6.1, 
and the open set condition is satisfied for the corresponding realization. The 
dimension is log ((-\/5 + 3)/2) / log 5 ss 0.59799; compare it to the upper bound 
obtained from the the ratio list (1/5, 1/5, 1/5), namely log3/log5 « 0.6826. 

Exercise 7.1.7. This is more like a class project than a homework as- 
signment. Of course there are many possible answers. My deconstruction in- 
volves four isosceles right triangluar blocks A,D,H,K and their reflections 
A' ,D' ,H ' , K' with the graph iterated function system and open set condition 
shown in Fig. 7.6.2. The fractal dimensions are « 1.92926. 

Exercise 7.2.7. If g is differentiable at a point a, and x n < a < y n , lima; ra = 
a = liiri y n , x n < y n , then 


g(yn)-g{x n ) ,, , 
lim = g (a). 

n ^°° Vn - X n 


This is false by Exercise 7.2.6. 
Exercise 7.2.8: 3/2. 



Fig. 7.6.1. A graph 
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Fig. 7.6.2. Leaf deconstruction 


Exercise 

Exercise 

Exercise 

Exercise 

Exercise 


7.2.14: [5, Part V], 
7.2.17: [49]. 

7.3.7: [23, Section 8.4], 
7.4.4: [6, Section 14]. 
7.5.6: [40]. 


Life is a fractal in Hilbert space. 
— Rudy Rucker, Mind Tools 


I am a strange loop. 
— Douglas Hofstadter 



Appendix 


Terms 

Terms are listed here with the page number of an appropriate definition. In 
most cases, there is a reminder of the definition here, but for the complete 
definition see the page cited. 

cr-algebra of sets, 147. Collection of subsets, contains the empty set and the 
whole space, closed under complements and countable unions, 
accumulation point of a set, 46. Every ball centered at the point meets 
the set. 

address, 125. Inverse image for the model map. 
affine, 67. /(tar + (1 - t)y) = tf(x) + (1 - t)f(y). 

at most n-to-one, 111. The inverse image of any point consists of at most 
n points. 

base for the open sets, 47. 

bicompact, 61. Every family of closed sets with the finite intersection prop- 
erty has nonempty intersection. 

Borel set, 147. Belongs to the cr-algebra generated by the open sets, 
boundary point of a set, 54. Belongs to the closure of the set and to the 
closure of the complement. 

boundary of a set, 54. The collection of all boundary points of the set. 
Cauchy sequence, 52. g(x n ,x m ) — > 0. 
clopen, 86. Closed and open, 
closed ball, 45. B r (x) = {y £ S : g(y , x) < r }. 
closed set, 46. Contains all of its accumulation points, 
closure of a set, 53. The set together with all of its accumulation points, 
cluster point of a sequence, 51. Every ball centered at the point contains 
infinitely many terms of the sequence, 
compact set, 61. 

complete metric space, 52. Every Cauchy sequence converges, 
concatenation of strings, 11. One string followed by the other. 
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conformal. 234. 
continuous function, 50. 
continuous curve, 67. 

contracting Mauldin-Williams graph, 130. r(e) < 1 after rescaling. 

contracting ratio list, 118. r, < 1. 

contraction, 54. g(f(x),f(y)) < rg(x,y ), r < 1. 

constituent, 171. pair (x,r) representing a ball. 

convergence of a sequence, 51. g{x n ,x) — > 0. 

countably compact, 60. Every infinite subset has at least one accumulation 
point. 

cover, 57. 

covering dimension, 91. 

cycle, 94. A path from a vertex back to itself. 

dense, 6, 53. 

diameter of a set, 45. Supremum of distances in the set. 

dilation, 6, 9. 
directed multigraph, 79. 

distance between two sets, 45. inf { g{a 1 b) : a £ A, b £ B}. 
empty string, 12. The string of length 0. 

Euclidean space, 42. R d . 
fixed point, 54. f(x) = x. 

fractal, 165, 177. indS 1 < dim S'; or indS < dimS = DimS. 
fractile, 24. A region with fractal boundary that tiles the plane, 
graph self-similar, 127. Decomposed according to a Mauldin-Williams 
graph. 

greatest lower bound, 13. The lower bound greater than all others. 

Hausdorff dimension, 168. 

Hausdorff measure, 167. 

Hausdorff metric, 71. D(A, B) = inf { r : A C N r (B) and B C N r (A) }. 

Hausdorff outer measure, 166. 

homeomorphic, 50. Related by a homeomorphism. 

homeomorphism, 50. Continuous in both directions. 

hyperspace, 71. Space of nonempty compact sets. 

infinite binary tree, 10. 

infinite string, 13. 

initial segment, 13. The beginning part of a string. 

interior point, 45. 
invariant list, 127. 
invariant set, 117. 

inverse Lipschitz, 56. g(f(x),f(y)) > Ag{x,y). 
isometric, 48. Related by an isometry, 
isometry, 48. Preserves distance. 

iterated function system, 117. 
large inductive dimension, 106. 

Lebesgue measurable set, 141. 
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Lebesgue measure, 142. 

Lebesgue number of an open cover, 64. 

Lebesgue outer measure, 139. 

lipeomorphism, 56. Ag(x,y) < g(f(x),f(y)) < B g(x,y). 

Lipschitz, 55. g(f(x),f{y)) < B g(x, y). 

Lipschitz order p, 215. g(f(x),f{y )) < B g(x, y) p . 

lower bound of a set, 191. An item < to all elements of the set. 

Mauldin-Williams graph, 127. 
measure, 147. 

mesh of a cover, 93. Maximum diameter of sets in the cover. 

metric, 41. 

metric outer measure, 156. Additive on sets with positive separation, 
metric property, 48. A property preserved by isometry. 

metric space, 41. 

open ball, 45. B r (x) = {y G S : g(y, x) < r }. 

open set condition, 191, 205. 

open set, 46. Every point is an interior point. 

order of a cover, 91. 
packing dimension, 174. 
packing measure, 173. 

path, 79. A sequence of consecutive edges in a graph, 
path forest, 80. Set of all paths of a graph. 

path model, 129. 

Perron numbers, 202. 

prefix, 13. Initial segment of a string. 

ratio list, 117. 

reduced cover class, 149. 

refinement, 86. 

rescale, 129. Replace the metrics by constant multiples, 
right shift, 123. 6 e (a) = ea. 
self-affine, 230. 

separable metric space, 58. Countable dense set. 
separation of sets, 56, 88. 

sequentially compact, 59. Every sequence has a cluster point. 

sim-value, 117, 131. 

similar, 48. Related by a similarity. 

similarity, 48. g(f(x),f(y )) = rg{x,y). 

small inductive dimension, 104. 

space-filling curve, 70. A continuous curve where the range includes a ball. 

strictly contracting, 129. r(e) < 1. 
string model, 122. 

strongly connected, 80. There is a path from any vertex to any other, 
subcover, 57. Subcollection of a cover that is still a cover. 

subsequence, 51. 
superfractal, 245. 
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topological property, 50. A property preserved by homeomorphism. 
trema, 2. part removed in construction. 

two-dimensional Lebesgue measure, 152. 
two-dimensional Lebesgue outer measure, 152. 
ultrametric space, 47. g{x,z) < ma x{g(x,y), g(y, z)}. 
uniform convergence, 66. 
uniformly continuous, 64. 

zero-dimensional, 86. Every open cover has a clopen partition refinement 


Notation 

An index of the notations used in the book. 


Z, set of integers: IX. 

N, set of natural numbers: IX. 

M, set of real numbers: IX. 

(a, b], intervals: IX. 

X \ A, set difference: IX. 

/: X — > Y, function: X. 

/[C], image set: X. 
limsup, upper limit: X. 
liminf, lower limit: X. 

(150) 3 , base 3: 4. 

L + s, translate of a set: 5. 

|a|, length of a string: 12. 

E^ n \ set of strings of length n : 13. 
E^*\ set of finite strings: 13. 

E^ u \ set of infinite strings: 13. 
a\n , prefix of a string: 13. 

A, the empty string: 13. 

[a], set of strings beginning with a: 
14. 

g(x,y), metric: 41. 

R d , d-dimensional Euclidean space: 
42. 

Pi/ 2 , metric for strings: 44. 
diamA, diameter of a set: 45. 

B r ( x), open ball: 45. 
dist(A,.B), distance between two 
sets: 45. 

B r (x ), closed ball: 45. 

(x n ), sequence: 50. 

A, closure of a set: 53. 
dA, boundary of a set: 54. 


Pu, uniform metric: 66. 

6(5, T), space of continuous 
functions: 66. 

D(A,B), Hausdorff metric: 71. 
N r (A), neighborhood of a set: 71. 
H(5), hyperspace: 71. 
g r , metric for strings: 75. 

E, alphabet: 75. 

V, vertices of a graph: 79. 

E, edges of a graph: 79. 

(V,E,i,t), directed multigraph: 79. 
i(e), initial vertex of an edge: 79. 
t(e), terminal vertex of an edge: 80. 
E uv , edges from u to v: 80. 

|a|, length of a path: 80. 

E^ n \ set of paths of length n: 80. 
E(*\ set of finite paths: 80. 

E^ u \ set of infinite paths: 80. 

A u , empty path: 80. 

[a], set of paths beginning with a: 
80. 

Cov 5, covering dimension: 92. 
ind 5, small inductive dimension: 
104. 

Ind 5, large inductive dimension: 
106. 

9 e , right shift: 123. 

(V, E, i, t, r), Mauldin- Williams 
graph: 127. 

£, Lebesgue outer measure: 139. 

£, Lebesgue measure: 142. 
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C . two-dimensional Lebesgue outer 
measure: 152. 

£ 2 , two-dimensional Lebesgue 
measure: 152. 

C d , d-dimensional Lebesgue measure: 
154. 


Examples 

120-degree dragon: 24. 
Appolonian gasket: 236. 
Barnsley leaf: 26. 

Besicovitch fractal: 238. 
Besicovitch-Ursell function: 231. 
Cantor dust: 1. 

Eisenstein fractions: 35. 
fudgeflake: 23. 

Heighway dragon: 20. 

Hironaka curve: 232. 

Julia set: 27. 

Kiesswetter curve: 230. 

Kline curves: 242. 

Koch curve: 19. 


Hausdorff outer measure: 167. 
JC®, Hausdorff measure: 167. 
dim S', Hausdorff dimension: 168. 
T s , packing measure: 173. 

Dim S, packing dimension: 174. 
spradA, spectral radius: 220. 


Li lace: 84. 

McWorter pentigree: 24. 
Menger sponge: 121. 

MW curves: 126. 

Peano curve: 70. 
pentadendrite: 197. 
Pharaoh’s breastplate: 30. 
Schmidt dragon: 198. 
self-affine dust: 229. 
Sierpinski gasket: 8. 
snowflake curve: 20. 
terdragon: 197. 
twindragon: 34. 
two-part dust: 201. 


Reading 

The mathematical theory of fractal geometry, as I understand it, rests on three 
mathematical foundations: metric topology, measure theory, and probability. 
The metric topology in Chap. 2 is almost all that is generally needed for the 
study of fractal geometry. (Some additional “descriptive set theory” may be 
useful.) The measure theory of Chap. 5 is, however, only a part of what is 
needed for the deeper study of fractal geometry. Additional topics related to 
measure theory, such as integration, potential theory, and harmonic analysis, 
will be needed by the serious student of fractal geometry. (Some of the texts 
are [11], [34], [58].) Probability is an important branch of mathematics that 
is used often in fractal geometry. In order to reduce the background required 
here, I have almost entirely avoided using it explicitly. But further study of 
fractal geometry almost surely will require study of probability theory. And 
by “probability theory” I mean the modern subject that depends on measure 
theory, rather than the older version that gets by with calculus alone. (Some 
of the texts are [7], [10].) 
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Here are some suggestions for further reading on the topics considered in 
this book. 

Benoit B. Mandelbrot, The Fractal Geometry of Nature [44]. This is the 
basic reference on fractals, together with a discussion of the applications of 
fractal sets in many branches of science. But it is not as mathematical as 
mathematicians would like, while it is too mathematical for many others. It 
contains many computer-generated pictures. 

James Gleick, Chaos: Making a New Science [31]. This is a non-technical 
account by a New York Times reporter. It is concerned with the scien- 
tific phenomena governed by chaotic dynamical systems. “Chaos” and “frac- 
tals” are not the same field of study, but there are many relations between 
them. 

Michael Barnsley, Fractals Everywhere [3]. This book is an introduction 
to fractal geometry from a somewhat different point of view than the one I 
have used here. A sequel, Superfractals , was published in 2006. 

W. Hurewicz and H. Wallman, Dimension Theory [35]. This is a book on 
topological dimension. It is a bit out of date by now, but it has everything 
you need to know when you restrict attention to separable metric spaces. It 
requires background in metric topology, such as our Chap. 2. Other references 
for topological dimension are [22], [53], [54], [55]. 

K. J. Falconer, Fractal Geometry [26]. This is a modern text on the subject. 
It contains much more material than I have included here. 

C. A. Rogers, Flausdorff Measures [57]. This book is concerned with the 
more technical aspects of the subject. It was written before Mandelbrot’s 
sudden popularity, illustrating the fact that mathematicians had been do- 
ing this sort of thing ever since Hausdorff. But before Mandelbrot, scien- 
tists (including many mathematicians) considered it to be of only minor 
importance. 

H.-O. Peitgen and P. H. Richter, The Beauty of Fractals [56]. This book 
is a discussion of Julia sets, the Mandelbrot set, and related topics, mostly 
without proofs. It contains many computer-generated color pictures; also an 
essay on whether they should be considered to be “art’.’ 

Chandler Davis and Donald E. Knuth, “Number representations and dra- 
gon curves” [13]. Two papers discuss the Heighway dragon (and some other 
dragons) , and how they are related to representations of numbers in complex 
bases. 

Michel Dekking, Michel Mendes France, and Alf van der Poorten, “Folds!” 
[14], (The title was inspired by a movie current at the time.) Heighway’s 
dragon can be generated by folding a strip of paper. A reader who can 
get past the “humor” will learn about this, and many other interesting 
topics. 

William A. McWorter and Jane M. Tazelaar, “Creating Fractals” [50]. 
Instructions for drawing a wide variety of dragon curves, with code in BASIC. 
This is the source for McWorter ’s pentigree. 
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